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FORM PTCM390 
(RJEV 11-98) 



U.S DEPARTMENT OF COMMERCE PATENT AND TRADEMARK OFFICE 

TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C. 371 



ATTORNEYS DOCKET NUMBER 

410.020 



U.S. APPLICATION NO. (If known, see 37 CFR 1.5) 

09/980771 



INTERNATIONAL APPLICATION NO. 

POT /FR00/0 1384 



INTERNATIONAL FILING DATE' 

May 19, 2000 



PRIORITY DATE CLAIMED 

May 21, 1999 

UMiiNT POLYPEPTIDE OF INTEREST 



TITLE OF INVENTION STARCH GRAN ULES IWA11MINU A KHJUW. 
a wmon nw ortawno T URM r AND THEIR USES 



APPLICANT(S) FOR DO/EO/US 

D'HULST et al 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 



i. La 

5. ra 



io. □ 



This is a FIRST submission of items concerning a filing under 35 U.S.C 371. 

This is a SECOND or SUBSEQUENT submission of items concerning a filing urrtfer 35 U.S.C. 371. 

This express request to begin national examination procedures (35 U.S.C. 371(f)) at any time rather than delay 

examination until the expiration of the applicable time limit set in 35 U.S.C. 371(b) and PCT Articles 22 and 39(1). 

A proper Demand for International Preliminary Examination was made by the 19th month from the earliest claimed priority date. 

v A'copy of the International Application as filed (35 U.S.C. 371(c)(2)) 

a. Q| is transmitted herewith (required only if not transmitted by the International Bureau). 

b. Q has been transmitted by the International Bureau. 

c. Q is not required, as the application was filed in the United States Receiving Office (RO/US). 
A translation of the International Application into English (35 U.S.C. 371(c)(2)). 

Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 371(c)(3)) 

a. Q are transmitted herewith (required only if not transmitted by the International Bureau). 

b. have been transmitted by the International Bureau. 

c. Q have not been made; however, the time limit for making such amendments has NOT expired. 

d. Q have not been made and will not be made. 

A translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371(c)(3)). 
An oath or declaration of the inventory) (35 U.S.C. 371(c)(4)). Unexecuted. 

A translation of the annexes to the International Preliminary Examination Report under PCT Article 36 
(35 U.S.C. 371(c)(5)). 



Items 11. to 16. below concern document(s) or information included: 

An Information Disclosure Statement under 37 CFR 1.97 and 1.98. 
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fNTERNAKONAL APPLICATION NO 

FCT/FROO/01^84 



ATTORNEY'S DOCKET NUMBER 



17. Q The following fees are submitted: 

BASIC NATIONAL FEE (37 CFR 1.492 (a) (1) - (5) ) : 

Neither international preliminary examination fee (37 CFR 1.482) 
nor international search fee (37 CFR 1 .445(aX2)) paid to USPTO 
and International Search Report not prepared by the EPO or JPO 3>y7U.uu 

International preliminary examination fee (37 CFR 1 .482) not paid to 

USPTO but International Search Report prepared by the EPO or JPO $840.00 

International preliminary examination fee (37 CFR 1.482) not paid to USPTO 

but international search fee (37 CFR 1.445(a)(2)) paid to USPTO $760.00 

International preliminary examination fee paid to USPTO (37 CFR 1.482) 

butall claims did not satisfy provisions of PCT Article 33(l)-(4) $670.00 

International preliminary examination fee paid to USPTO (37 CFR 1 .482) ^ 
and all claims satisfied provisions of PCT Article 33(1 )-(4) $96.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 



CALCULATIONS pro use only 



$1040.00 



$L040,00 



Surcharge of $130.00 for furnishing the oath or declaration later than Q 20 [~~| 30 
months from the earliest claimed priority date (37 CFR 1.492(e)). 



CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



Total claims 



?1 



-20 



X $18.00 



18.00 



independent claims 



1 -3 - 



X $78.00 



MULTIPLE DEPENDENT CLAIM(S) (if applicable) 



+ $260.00 



TOTAL OF ABOVE CALCULATIONS = 



1058,00 



Reduction of 1/2 for filing by small entity, if applicable. A Small Entity Statement 
must also by filed (Note 37 CFR 1.9, 1.27, 1.28). 



SUBTOTAL = 



1058.00 



Processing fee of $130.00 for furnishing the English translation later than Q20 | | 30 
months from the earliest claimed priority date (37 CFR 1 -492(f))- + 



TOTAL NATIONAL FEE = 



$ 1058.00 



Fee for recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.3 1). $40.00 per property 



TOTAL FEES ENCLOSED = 



5 1058.00 



Amount to be: 
refunded 



charged 



b. O Please charge my Deposit Account No. 



in the amount of $ 



to cover the above fees 



A duplicate copy of this sheet is enclosed. 



j~3 The Commissioner is hereby authorized to charge_any additional fees which may be required, or credit any 
overpayment to Deposit Account No. 02-227 5 . A duplicate copy of this sheet is enclosed. ■ 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
1.137(a) or (b)) must be filed and granted to restore the application to pending status. 



SEND ALL CORRESPONDENCE TO 

Bierrnan., Muserlian and Lucas 
600 Third Avenue 
New York, NY 10016 
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Our Ref . : 410. 020 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re Application of: : 
D'HULST et al : 

PCT/FR00/01384 : PCT Date: May 19, 2000 

Serial No. : : 

Filed: Concurrently Herewith : 

For: STARCH GRANULES ... AND THEIR : 
USES : 

600 Third Avenue 
New York, NY 10016 



PRELIMINARY AMENDMENT 



Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

Please amend this application as follows: 



IN THE SPECIFICATION: 



Page l, before line 1, insert , . 

—This application is a 371 of PCT/FR00/01384 filed May 
19, 2000. — 



IN THE CLAIMS : 



Claim 1 (amended) A recombinant nucleotide sequence 
containing, in the 5'->3' direction, a nucleotide sequence coding 
for an adenosine diphosphate glucose a-1, 4-glucan oe-4- 
glucosyltransf erase or starch synthase EC 2.4.1.21, or for a 



protein derived from this enzyme, by suppression, addition or 
substitution of at least one amino acid, the said enzyme or derived 
protein having the property of migrating to the sites of 
biosynthesis of the starch granules in plant cells and of attaching 
to the starch granules, the said nucleotide sequence coding for the 
enzyme or aforementioned protein being positioned upstream of a 
nucleotide sequence coding for a peptide or polypeptide of 
interest. 

Claim 2 (amended) A recombinant nucleotide sequence of Claim 
1, wherein the nucleotide sequence coding for a starch synthase, or 
for a derived protein, codes for the starch synthase bound to the 
starch granule or GBSS present in plants, algae or micro-algae. 

Claim 3 (amended) A recombinant nucleotide sequence of Claim 
1 wherein the nucleotide sequence coding for a starch synthase, or 
for a derived protein, is selected from the group consisting of 

the nucleotide sequence SEQ ID NO : 1 of the cDNA coding 
for the GBSSI of Chlamydomonas reinhardtii, 

or a fragment of the nucleotide sequence SEQ ID NO : 1. 
shown in which the nucleotide of the 5' end corresponds to that 
located in one of the positions 1 to 186 of SEQ ID NO : 1, and in 
which the nucleotide of the 3 ' end corresponds to that located in 
one of the positions 1499 to 3117 of SEQ ID NO : 1. 

Claim 4 (amended) A recombinant nucleotide sequence of 
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Claim 1, wherein the nucleotide sequence coding for a peptide or 
polypeptide of interest is selected from the group consisting of 

those encoding biologically active peptides, especially 
peptides of therapeutic interest or that can be used in the 
agricultural and food industry, and 

those encoding enzymes that are able to transform starch. 

Claim 5 (amended) A recombinant nucleotide sequence of 
Claim 1, wherein it contains a nucleotide sequence encoding a 
cleavage site, the said nucleotide sequence being positioned 
between the nucleotide sequence coding for a starch synthase, or a 
protein derived from the latter, and the nucleotide sequence 
encoding the polypeptide of interest. 

Claim 6 (amended) A transgenic plant cell selected from the 
group consisting of cells of plants, algae and micro-algae, that 
are able to produce starch, the said cells containing a recombinant 
nucleotide sequence of Claim 1 integrated in its genome or 
maintained in a stable manner in its cytoplasm . 

Claim 7 (amended) A member of the group consisting of 
transgenic plants , algae , micro-algae , flowers , fruits , leaves , 
stems, roots, seeds, and fragments of these plants, algae or micro- 
algae, containing a recombinant nucleotide sequence of Claim 1 
integrated in the genome or maintained in a stable manner in the 
cytoplasm of the cells of which they are composed. 



Claim 8 (amended) A fusion polypeptide, containing 

- in the N-terminal position, a starch synthase, or a protein 
derived from this enzyme, especially by suppression, addition or 
substitution of one or more amino acids, the said starch synthase 
or derived protein having the property of migrating to the sites of 
biosynthesis of the starch granules in plant cells and of attaching 
to the starch granules, 

- and, in the C-terminal position, a peptide or polypeptide of 
interest, 

the C-terminal part of the amino acid sequence of the starch 
synthase, or of the derived protein, thus being bound to the N- 
terminal part of the peptide sequence of interest, the said fusion 
polypeptide being encoded by a recombinant nucleotide sequence of 
Claim 1. 

Claim 9 (amended) A fusion polypeptide of Claim 8, wherein 
the starch synthase is selected from the group consisting of 

the peptide sequence SEQ ID NO : 3 corresponding to the 
GBSSI of Chlamydomonas reinhardtii in the form of pre-protein of 
708 amino acids, 

or a fragment of the peptide sequence SEQ ID NO : 3, 
wherein the sequences in which the amino acid of the amino terminal 
end corresponds to that located in one of the positions 1 to 58 of 
SEQ ID NO : 3, and in which the amino acid of the carboxy terminal 
end corresponds to that located in one of the positions 495 to 708 
of SEQ ID NO : 3 selected from the group consisting of 



• the sequence SEQ ID NO : 5 corresponding to the GBSSI 
of Chlamydomonas reinhardtii in the form of mature protein of 651 
amino acids, 

- the sequence SEQ ID NO : 7 corresponding to a fragment of 
438 amino acids of the peptide sequence of the GBSSI of 
Chlamydomonas reinhardtii, 

• the sequence SEQ ID NO : 9 corresponding to a fragment 
of 531 amino acids of the peptide sequence of the GBSSI of 
Chlamydomonas reinhardtii , 

and a peptide sequence derived from an aforementioned 
peptide sequence or fragment, by substitution, suppression or 
addition of at least one amino acid, and having the property of 
attaching to the starch granules, the said derived peptide sequence 
preferably having a homology of at least about 60%, with the 
aforementioned peptide sequence or fragment. 

Claim 10 (amended) A fusion polypeptide of Claim 8 wherein it 
contains a cleavage site positioned between, on the one hand, the 
starch synthase, or a protein derived from the latter, and, on the 
other hand, the polypeptide of interest. 

Claim 11 (amended) A starch granule, containing at least one 
fusion polypeptide of Claim 8. 

Claim 12 (amended) A pharmaceutical composition comprising a 
starch granule containing at least one fusion polypeptide of Claim 



8 and a physiologically acceptable vehicle, the peptide of interest 
in the said fusion polypeptides possessing a defined therapeutic 
effect. 

Claim 13 (amended) A pharmaceutical composition of Claim 12, 
wherein the diameter of the starch granules is between about 0.1 /xm 
and several tens of /an, and the proportion by weight of the fusion 
polypeptides in these granules is between about 0.1% and 1%. 

Claim 14 (amended) A pharmaceutical composition, containing 
at least one fusion polypeptide of Claim 8 and a physiologically 
acceptable vehicle, the peptide of interest in the said fusion 
polypeptides possessing a defined therapeutic effect. 

Claim 15 (amended) A food composition containing starch 
granules of Claim 11, the peptide of interest in the said fusion 
polypeptides being usable in the food-processing field. 

Claim 16 (amended) A method of preparation of starch granules 
of Claim 11, comprising 

transformation of plant cells, by means of a cellular 
host, or transformed by a recombinant vector, containing a 
recombinant nucleotide sequence of Claim 1, 

- obtaining plants, algae or micro-algae transformed so that 
their genome contains at least one or more nucleotide sequences of 



Claim 1, by in vitro culture of the aforementioned transformed host 
cells, 

- optionally fertilization and recovery of the seeds of the 
plants obtained in the preceding stage, and cultivation of these 
seeds to obtain plants of the next generation and 

- extraction of the starch granules from a member of the group 
consisting of the plants, algae or micro-algae, flowers, fruits, 
leaves, stems, roots, or fragments of these aforementioned 
transformed plants, algae or micro-algae. 

Claim 17 (amended) A method of preparation of fusion 
polypeptides of Claim 8 by the method of Claim 16 comprising an 
additional stage of recovery, and optionally, purification of the 
fusion polypeptides from the starch granules . 

Claim 18 (amended) A method of preparation of a peptide by 
the method of Claim 16 by transformation of host cells with the 
nucleotide sequences of Claim 5, and includes an additional stage 
of cleavage of the fusion polypeptide obtained, by means of a 
suitable reagent, then, optionally, a stage of purification of the 
polypeptide of interest. 

Claim 19 (amended) A method of biotransformation of starch 
granules , comprising 

transformation of plant cells by means of a cellular 
host, or transformed by a recombinant vector, containing a 



recombinant nucleotide sequence of Claim 4, encoding enzymes that 
are able to transform starch, 

- obtaining plants, algae or micro-algae transformed so that 
their genome contains at least one aforementioned nucleotide 
sequence, by in vitro culture of the aforementioned transformed 
host cells, 

- optionally, fertilization and recovery of the seeds of the 
plants obtained in the preceding stage, and cultivation of these 
seeds to obtain plants of the next generation, 

- extraction of the starch granules from a member of the group 
consisting of the plants, algae, micro-algae, flowers, fruits, 
leaves, stems, roots, or fragments of these aforementioned 
transformed plants, algae or micro-algae, 

- optionally heating of the said starch granules to a 
temperature at which the peptide of interest of the fusion 
polypeptide is capable of being active. 

PLEASE ADD THE FOLLOWING CLAIMS : 

— 20. A recombinant nucleotide sequence of Claim 3 selected 
from the group consisting of 

• the sequence SEQ ID NO : 2 coding for the GBSSI of 
Chlamydomonas reinhardtii in the form of pre-protein of 708 amino 
acids (SEQ ID NO : 3) , 

the sequence SEQ ID NO : 4 coding for the GBSSI of 
Chlamydomonas reinhardtii in the form of a mature protein of 651 
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amino acids (SEQ ID NO : 5) , 

♦ the sequence SEQ ID NO 
amino acids (SEQ ID NO : 7) 
reinhardtii , 

• the sequence SEQ ID NO 
amino acids (SEQ ID NO : 9) 
reinhardtii , 



: 6 coding for a fragment of 438 

of the GBSSI of Chlamydomonas 

: 8 coding for a fragment of 531 

of the GBSSI of Chlamydomonas 



a nucleotide sequence derived by degeneration of the 
genetic code of the aforementioned nucleotide sequences, and coding 
for the aforementioned GBSSI of Chlamydomonas reinhardtii, or for 
an aforementioned peptide fragment of the latter, 

- a nucleotide sequence derived from a nucleotide sequence 
or fragment mentioned above, especially by substitution, 
suppression or addition of one or more nucleotides, and coding for 
a peptide sequence derived from the aforementioned GBSSI of 
Chlamydomonas reinhardtii , or derived from an aforementioned 
peptide fragment of the latter, and having the property of 
attaching to the starch granules, the said derived nucleotide 
sequence preferably having a homology of at least about 50%, and 
preferably of at least about 70%, with the aforementioned 
nucleotide sequence or fragment, 

- and a nucleotide sequence capable of hybridizing with one 
of the aforementioned nucleotide sequences or fragments. 

21. A sequence of Claim 4 wherein the enzyme is one that 
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interacts with a-glucans including various hydrolases, 
phosphorylases, a-1,4 glucanotransf erases, branching enzymes, 
amylases, and heat-resistant hydrolases obtained from 
extremophiles . — 



REMARKS 



The amendment is submitted to insert reference to the PCT 
application, to remove multiple dependency from the claims and to 
conform the claims to the American practice. 



Respectfully submitted, 
BIERMAN, MUSERLIAN AND LUCAS 




Charles' A. Muserlian, #19 , 683 
Attorney for Applicant (s) 
Tel. # (212) 661-8000 



CAM:sd 

Enclosures: Marked-Up Version of Claims 
Return Receipt Postcard 
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MARKED-UP VERSION OF 
CLAIMS 

1* A recombinant nucleotide ^^"f^ characterized^ 'that it uunla t iiih in Ihc 
y->y direction, a nucleotide sequence coding for an adenosine diphosphate, glucose a- 
1,4-glucan a~4-glucosyltransferase or starch synthase EC 2.4.1.21, or fur a pro^i^ 
derived from this enzyme, esp®2£aH.y by suppression, addition or substitution of/bne * 
*«rfe amino acid^, the said enzyme or derived protein having the property of migrating 
io the sites of biosynthesis of the starch granules in plant cells and of attaching xo the 
starch granules, the said nucleotide sequence coding for the enzyme or aforementioned 
protein being positioned upstream of a nucleotide sequence coding for a peptide or 
polypeptide of interest. 

2. A recombinant nucleotide sequence aeeordm|pto Claim U ch»^tcri7^ii m 
a^tt the nucleotide sequence coding for a starch synthase, or for a derived protein, codes 

for the starch synthase bound to the starch granule or GBSS present ia^sa^teaafcir in 
plants, algae or micro-algac, » H ™rr r^ jaU y for the isnfbtm GBSSfe or for a 
^piniein-d erivcd from GBSS as d e fined in Cla i m L 

3. A recombinant nucleotide sequence *G€e*=diftf"to Claim 1 or- 2, characterized 
i*4ta* the nucleotide sequence coding for a starch synthase, or for a derived protein, is 
selected fromtL^ <^^f co^'V^ ^ 

- the nucleotide sequence SEQ ID NO . 1 of the cDNA coding for the GBSSI of 
CMamydomonas reinhardtii, 

. or a fragment of the nucleotide sequence SEQ ID NO : 1 shown^^ueJo^ie-Ae- 
s^ttene^s in which the nucleotide of the 5' cud corresponds to that located in one of the 
positions 1 to 186 of SEQ ID NO : L and in which the nucleotide of the _T end 
corresponds to that located in one of the positions 1499 to 3117 of SEQ ID NO : 1, 

the sequence SEQ ID NO : 2 coding for the GBSSI of CMamydomonas 
reinhardtii in the form of pre protein of 708 amino acids {SEQ ID NO : 3), 

the sequence SEQ ID NO : 4 coding for the GBSSI of CMamydomonas 
reinhardtii in the form of mature protein of 651 amino acids (SKQ TD NO : 5). 

. the sequence SEQ ID NO : 6 coding for a fragment of 438 amino acids 
(SEQ ID NO ; 7) of ihc GBSSI of Chlamydomonas reinhardziU 



Fax cwi3 par 
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. the sequence SEQ ID NO : 8 coding for a fragment of 531 amino acids 
(SEQ ID NO : 9) of the GBSSI of Chlamydomonas reinhardtiU 

- jfy a nucleotide sequence derived by degeneration of the genetic code of the 
aforementioned nucleotide sequences, and coding tor the aforementioned GBSSI of 

S Chlamydomonas reinhardtii y or for an aforementioned peptide fragment of the latter, 

- m a nucleotide sequence derived from a nucleotide sequence or fragment 
mentioned above, especially by substitution, suppression or addition of one or more* 
nucleotides, and coding for a peptide sequence derived from the aforementioned GBSSI 
of Chlamydomonas reinhardtii, or derived from an aforementioned peptide fragment of 

10 the latter, and having the property of attaching to the starch granules, the said derived 

nucleotide sequence preferably having a homology of at least about 50%, and preferably 
of at least about 70%, with the aforementioned nucleotide sequence or fragment, 

- af a nucleotide sequence capable of hybridizing with one of the 
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aforementioned nucleotide sequences or fragments. 



4. A recombinant nucleotide sequence according to one of tW Claim! 1 t&*%r 



• rhflfflfltrr^cH in r h nt the nucleotide sequence coding for a peptide or polypeptide of 

interest is selected from.t^ <j/*>~~-p <u^ t^ &f 

- those encoding biologically active peptides, especially peptides of therapeutic 

20 interest or that can be used in the agricultural and food industry, fT^J^ ^ ^ ^ 

/ those encoding enzymes that are able to transform starclu ssetwa enzymea/that 

ju^Kib^- . Z- ~> 1 

^i^^Y^J^^ interact with a-glucans including various hydrolases, phosphorylases, a-1,4 

' ] glucanotransferases, branching enzymes, amylases, and espe^aBy heat-resistant 

hydrolases obtained from extremnphiles swb-^^ardiatbiick-iia dial ait.' active at J 

25 te piperaturea -a kovc 10 p C t 

5. A recombinant nucleotide sequence a cc o r d ing to ono - of fee Claimf 1 4o 4, 
^h ^ractcrizcdJ ±tJfeat it contains a nucleotide sequence encoding a cleavage site, the said 
nucleotide sequence being positioned between the nucleotide sequence coding for a 

10 ^larch synthase, or a protein derived from the latter, and the nucleotide sequence 

encoding the polypeptide of interest. / 

6. f\ transgenic plant cell^selected from thc/cells^of plants, algae micro- 
algae, thai are able to produce starch, the said cells containing a recombinant nucleotide 
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sequence ^according to eae of the-Claim* 1 tq^jT integrated in its genome or maintained 
in a stable manner in its cytoplasm. 

7* J ^Tansgenic plants, algae & micro-algae, ar^a as, espec i ally flowers, fruits, 
leaves, stems, roots, seeds, Sr^fragments of these plants, algae or micro-algae, 
containing a recombinant nucleotide sequence a cco r ding to on e of Use Claimf 1 46gS> 
integrated in the genome or maintained in a stable manner in the cytoplasm of the cells 
of which they are composed. 

8. A fusion polypeptide, i ^flrnc.tcrizcd in that it -contain 

- in the N-tcrminal position, a starch synthase, or a protein derived fiom this 
enzyme, especially by suppression, addition or substitution of one or more amino acids, 
the said starch synthase or derived protein having the property of migrating to the sites 
of biosynthesis of the starch granules in plant cells and of attaching to the starch 
granules, 

- and, in the C-terminal position, a peptide or polypeptide of interest, 

the C-tenninal pait of (he amino acid sequence of the starch synthase, or of the 
derived protein, thus being bound to the N-terminal part of the peptide sequence of 
interest, the said fusion polypeptide being encoded by a recombinant nucleotide 
sequence avoiding tu uiic- of-tfee Claim:/ 1 



$4 & kt-s*i-i£> 

9- A fusion polypeptide. gccording-*o Claim 8, eharaot o rhiud in that the starch 
synthase is selected fromtl>- ^*Mny> cc-^^^f ^ 

- the peptide sequence SEQ ID NO: 3 corresponding to the GBSST of 
Chlamydomonas rcinhardiii in the form of pre-protein of 708 amino acids, 

- or a fragment of the peptide sequence SEQ TD NO : 3, Isuc^'the sequences in 
which the amino acid of the amino terminal end corresponds to that located in one of 
the positions 1 to 58 of SEQ ID NO : 3, and in which the amino acid of the carboxy 
terminal end corresponds to that located in one of the positions 495 to 708 of SEQ ID 
NO : 3. eqvrva^^ — ih-*\w^y> ^^yid^ 

. the sequence SEQ ID NO : 5 corresponding to the GBSSI of 
Chlamydomonas reinhardtii in the form ormalure protein of 651 amino acids, 

. the sequence SFQ TD NO • 7 corresponding to a fragment of 438 amino 
acids of the peptide sequence of the GBSSI of Chlamydomonas reinhardtii^ 
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. the sequence SEQ ID NO : 9 corresponding to a fragment of 531 amino 
acids of the peptide sequence of the GBSSI of Chlamydomonas reinhardtii, 
- j^r^Tpcptidc sequence derived from an aforementioned pc^d^, sequence or 
fragment, ^spe&agy by substitution, suppression or addition of J one Quaere amino 
;ici<1s, and having the property of attaching to the starch granules, the said derived 
peptide sequence preferably having a homology of al least about 60%, aad — 
a 4vantagcously - at l e ast about - 80% . with the aforementioned peptide sequence or 
fragment. 

10* A fusion polypeptide j £Cording - t r > Claim % ni 0, i.1iau:uTii!i iauJ in llidfr it 
contains a cleavage site positioned between, on the one hand, the starch synthase, or a 
proiein derived from the latter, and, on the other hand, the polypeptide of interest. 



11./ Starch granul<jS, rt m r nrtrrir > ed in^th n t th n y r n nt n 



zontein one or r more fusion 

polypeptide defined in one of the Claims 8 terttT 

12, A pharmaceutical composition, ^baiaA™ j ^cdr in that ii.^ a«4affis slarch 

granule/ ^ac^^ 1L- if Hg gg gs aty in combin ation*^! a physi ologically 

acceptable vehicle, fe e - said gr i aiirf ^^^ ^i^^on e o^m &^e fus ion polypeptide^ <M^& 
r i r fi ]1 r d in nnr nf fhr B fn ia the peptide of interest in Ihc said fusion 

polypeptides possessing a defined therapeutic effect. 



13. A pharmaceutical composition according t o Claim 12, efra raeterized in that - 

form that can he idmrnkflmf^ diameter of the starch granules bsaag between 
about 0.1 pm and several tens of pm, and the proportion by weight of the fusion 
polypeptides in these granulesi<£»g between about 0.1% and 1%. 

14. A pharmaceutical composition, characterized in tha^ t- contain? one m jitwc"^ 
fusion polypeptides as dftfined-jflrone of th*Claim^8 to 10, i f ne^eg a ry - in combination 
vcjik a physiologically acceptable vehicle, the peptide of interest in the said fUsion 
polypeptides possessing a defined therapeuiic effect. 
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ir 7 

15. A food rnmprrefrinn^ ^ptM^trml m . rtml \\ COHtaln^ StaTCh gTailUlCS 

Tirrordim^ to Claim 1 1, the-saitl g ran n i es co ntaining one or mmr fiwinn polypeptide or 
tjefin ^d in one of fh<fl Claims 8 to 10 , the peptide of interest in the said fusion 
polypeptides being usable in the food-processing field. 




16- A method of preparation of starch granules ac c o rding - to Claim 11, 
t frrtrartffrrml in ftwj4t comprise s the following stages : 

- transformation of plant cells, hy means of a cellular host, * *m . U as- 
A^ ohacterium tumcfaciom y transformed by a recombinant vector, espe cially bf th e- 
pJaamd , rosm i d . or phage type^ containing a recombinant nucleotide sequence 
according tr> nno of fbt Claim| I 



So 

- obtaining plants, algae or micro-algae transformed in s uch a wa y that their 
genome coniainsyone or more nucleotide sequences ae cordmg to one of the-Clainu 1 ts^ 
^ by in vitro culture of the aforementioned transformed host cells, 

- >£ nc i fc55a^ ^eitiHzation and recovery of the seeds of the plants obtained in the 
preceding stage, and cultivation of these seeds to obtain plants ot the next generation*^ 

- extraction of the starch granules fromfthe plants, algae or micro-alga©, or from-* - 
ffegj**- ^[inririlly flowers, fruits, leaves, sterna, roots, or fragments of these 
aforementioned transformed plants, algae or micm-algac. .espedajly hy sedimapiatinn 



17, A method of preparation of fusion polypeptides jicc srdiugQr -arie of lire 
Claimfc 8 te - 10, characterized in tha t iLin cludes the implementatio n rft f the metho d 
, arcm-ding - to Cla i m i6^me said method comprising an additional stage of recovery, and 
if nPcesgaTVyfrf purification, of ihe fusion polypeptides from the starch granules. 

18- A method of preparation of a peptide o £ interest, q hq ractc jirrH in thnt it 
includesjl ie i mplemgntation^f ' the method ascordl^^to Claim 1 6 en Claim 17r4he-said 
m ethod hainfl r.arrjg d-o*it by transformation of host cells with the nucleotide sequences 
aeeopdittg-S^ Claim 5, and includes an additional stage of cleavage of the fusion 
polypeptide obtained, by means of a suitable reagent, then, if ff<rcSsa& a stage of 
purification of the polypeptide of interest. 

19. A method of biotransformation of starch granules, ebar aetcrized in that t t 
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- transfonnation of plant cells* by means of a cellular host, ,^uch -gts 
Aigrnhnrterhim umofnrioia^ transformed by a recombinant vector, ^ftp cfl iHIy o f th e 
filas mid, cosmid or phapp; tv|v t containing a recombinant nucleotide sequence 
aeeefdia|fc> Claim 4, encoding enzymes that are able to transform starch, 

- obtaining plants, algae or micro-algae transformed 4a such a way that their 
genome contamspne jnuw aforementioned nucleotide sequences, by in vitro culture 
of the aforementioned transformed host cells, 

- iLnrrryrtfy, fertilization and recovery of the floods of the plants obtained in the 
preceding stage, and cultivation uf these seeds to obtain plants of the next generation, 

- extraction ot the starch granules fromjthe plants, algae m micro-algae, €*-J*e«r 
p arte r o opocially flowers, fruits, leaves, stems, roots, or fragments of these 
aforementioned transformed plants, algae or micro-algae, especially by ^edijueuldtiui t; 

- if - nfee sa arffi heating of the said sfarrh granules to a temperature at which the 
peptide of interest of the fusion polypeptide is capable of being active. 
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STARCH GRANULES CONTAINI NGA RECOMBINANT POLYPEPTID ES^ 
INTEREST, A METHOD OF OBTAINING THEM, AND THEIR USES 



The present invention relates to starch granules containing a recombinant 
polypeptide of interest, a method of obtaining them, as well as their uses, especially in 
pharmaceutical compositions. 

Starch is one of the world's most important sources of polysaccharides, occurring 
in particular in plants (maize, potato, wheat, rice, barley, etc.), algae, micro-algae etc. 
Starch occurs in the form of granules that are insoluble in water, the size of which can 
vary from 0.1 to several tens of |im in diameter depending on its origin (plants, algae or 
micro-algae) or even the genotype of the plant in question. Thus, the sizes of these 
granules vary from 0.1 pm in diameter to more than 50 jam in diameter. Furthermore, 
the degrees of crystallinity of these granules range from 0% (for granules rich in 
amylose) to over 30%. There are three or four crystalline types (A, B, C, V). The 
granule grows by the laying-down of alternately amorphous and semicrystalline layers 
starting from the centre of the starch granule. 

Starch contains several distinct polysaccharide fractions, composed of glucans 
bound at a- 1,4 and branched at a- 1,6. More particularly, starch consists of two glucose 
polymers: amylose on the one hand, the minor fraction of the granule (about 20-30 
wt.%), of low molecular weight, with little branching (< 1% of a- 1,6 bonds) and 
amylopectin on the other hand, the major fraction of the granule (70-80 wt.%), of high 
molecular weight and highly branched (5% of a- 1,6 bonds). Amylose is not necessary 
for the development of crystallinity of the starch granule; it is now known that it is 
amylopectin that is responsible for the crystallinity of the starch granule. 

In biological terms, strictly speaking starch only occurs in the plant kingdom, and 
more specifically in the chloroplasts or in the non-photo synthetic granules of the 
eukaryotic plant cell. Two types of starch can be synthesized by plants: temporary or 
photo synthetic starch (synthesis of which takes place at the level of the chloroplasts), 
and reserve starch (synthesis of which takes place at the level of the amylop lasts). 

Synthesis of starch in plants involves a whole panoply of enzymes taking part in 
biosynthesis of the precursor ADP-glucose, scaffolding of the amylose and amylopectin 
molecules and, finally, degradation of the starch granule. 



The first stage in the biosynthesis of starch is the production of the precursor 
ADP-glucose with the involvement of the two enzymes: phosphoglucomutase (PGM) 
and ADP-glucose pyrophosphorylase (AGPase). 

The second stage in the biosynthesis of the starch granule also involves two types 
of enzymes, mainly taking part in the synthesis of amylose and amylopectin: starch 
synthases (or adenosine diphosphate glucose a-l,4-glucan a-4-glucosyltransferases) 
and branching enzymes (or a-l,4-glucan-6-glucosyltransferases). The starch synthases 
catalyse the transfer of the glucose residue from ADP-glucose onto growing chains of 
glucans by creating an O-glycosidic bond of type a- 1,4. Then the branching enzymes 
hydro lyse an a- 1,4 bond of an elongating glucan, and then join the fragment thus 
released onto the remainder of the glucan by means of an a- 1,6 bond. 

With regard to starch degradation, there are two main families of degrading 
enzymes: the hydrolytic enzymes (hydrolases) on the one hand, such as a- amylases 
(endomylases), (3-amylases (exomylases), y-amylases (amyloglucosidases), D-enzymes 
(glucosyltransferases), R-enzymes (debranching enzymes), oc-glucosidases (maltases) 
and, on the other hand, the phosphorolytic enzymes (or starch phosphorylases). 

Several isoforms of starch synthases occur together in higher plants. The main 
difference between these isoforms relates to their solubility (i.e. they are dissolved in 
the plastid stroma in plants) or to the fact that they are bound to the starch granule. 

The starch synthases bound to the starch granule (or GBSS: Granule Bound Starch 
Synthases) occur in close association with starch. Several isoforms of GBSS have been 
isolated in maize, pea, potato or wheat (MacDonald and Preiss, 1985; Smith, 1990; Dry 
et al, 1992; Denyer et al, 1995). In all cases, GBSSI is the main isoform; the part 
played by this isoform in the biogenesis of the starch granule is the formation of 
amylose (Tsai, 1974; Hovenkamp-Hermelink et al, 1987; Delrue et al, 1992; Denyer et 
al, 1995). A mutation at the loci WXof cereals, AMF of the potato, and LAM of the pea, 
combines disappearance of GBSSI with complete collapse of the amylose fraction of 
starch. A cDNA corresponding to the "Waxy protein" (through a misuse of language, 
the term "Waxy protein" is employed to designate the GBSSI in plants, thus 
distinguishing it from other GBSS) has been isolated in wheat, barley, maize, rice, 
potato and pea. Comparisons of the relative protein sequences show there is 
considerable homology between the different species (Ainsworth et al, 1993). 
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GBSSI is not the only starch synthase bound to the starch granule. Other isoforms 
are found bound to the starch granule in pea, potato, maize or wheat {Smith, 1990; Dry 
et al, 1992; Mu et al, 1994; Denyer et al, 1995). However, the roles of these various 
isoforms are not yet clear. Furthermore, most of them are also found in the soluble 
5 phase. 

The soluble starch synthases (SS) are not bound to the starch granule, but are 
found in soluble form in the plastid stroma of plants. As with the bound forms, several 
forms of soluble starch synthases occur in the higher plants. For example, three 
isoforms of soluble starch synthases (SSI, SSII and SSIII) have been detected in the 
10 potato tuber. 

cDNA's corresponding to various forms of soluble starch synthases have been 
cloned in higher plants (Baba et al, 1993; Dry et al, 1992; Edwards et al, 1995; Abel 
O et al, 1996; Marshall et al, 1996; Gao et al y 1998). Sequence comparison flowing 

t ii from this clearly shows the presence of three regions that are highly conserved across 

!if 15 the isoforms, whether within a single species or between species of higher plants. 

SI Recent research by the Inventors made it possible to establish that soluble starch 

"r.£ 

\1 synthase II (SSII) from Chlamydomonas reinhardtii is mainly involved in the formation 

;L of crystals of the amylopectin molecule. 

1^ On the other hand, GBSSI does not take part in the construction of amylopectin 

J^; 20 crystals. GBSSI activity has never been detected in the soluble phase. GBSSI is 

\f t intimately associated with the starch granule. However, in contrast to amylase, no unit 

binding to starch has been found in the GBSSI sequences described so far. Accordingly, 
the mechanism controlling the binding of GBSSI to the starch granule is unknown. 

Starch synthases are of particular interest in that these enzymes might make it 
25 possible to transport a recombinant peptide of interest towards the plastids where the 

biosynthesis of starch granules takes place. Thus, the transformation of plants with 
sequences coding for fusion peptides between a starch synthase and a peptide of interest 
would make it possible to obtain starch granules in large quantity, from which the said 
peptide of interest could be recovered. 
30 It was with this objective that the authors of International Application WO 

98/14601 (Exseed Genetics) described nucleotide sequences coding for fusion proteins 
in which the polypeptide of interest is bound to the amino terminal end of a starch 
synthase selected from the group comprising soluble starch synthases I, II and III (SSI, 
SSII, SSIII), granule bound starch synthases (GBSS), branching enzymes I, Ha and IEBb 



and the glucoamylases. However, no method of transformation of plants by means of 
the sequences described in that application, and hence of obtaining starch granules 
transformed by the said sequences, is illustrated in detail. 

The present invention arises from the demonstration by the inventors that only the 
transformation of plants with nucleotide sequences coding for fusion polypeptides in 
which the polypeptide of interest is bound to the carboxy terminal end of the starch 
synthase makes it possible to obtain starch granules containing the said peptide of 
interest. 

One of the aims of the present invention is to provide novel nucleotide sequences 
coding for fusion proteins capable of transporting a peptide of interest towards the site 
of biosynthesis of the starch granules in plant cells (including the cells of algae or 
micro-algae). 

Another aim of the present invention is to provide plants that have been 
transformed by means of the aforementioned nucleotide sequences, the said plants 
producing starch containing a polypeptide of interest. 

Another aim of the present invention is to provide starch granules containing a 
polypeptide of interest. 

Another aim of the present invention is to provide a method of preparation of 
these starch granules. 

Another aim of the present invention is to provide a method of preparation of a 
recombinant polypeptide of interest starting from these starch granules. 

Another aim of the present invention is to provide compositions, especially 
pharmaceutical or for foodstuffs, containing the aforementioned starch granules. 

Another aim of the present invention is to provide a method of biotransformation 
of starch granules when the said peptide of interest that is used is capable of 
transforming starch. 

The invention will be illustrated below with the aid of the following diagrams: 

- Fig. 1: cDNA coding for the carboxy terminal part of the GBSSI of 
Chlamydomonas reinhardtii, 

- Fig. 2: cDNA coding for the GBSSI of Chlamydomonas reinhardtii, and 
peptide sequence of the GBSSI of Chlamydomonas reinhardtii. 

The present invention relates to any recombinant nucleotide sequence 
characterized in that it comprises, in the direction a nucleotide sequence coding 

for an adenosine diphosphate glucose a-l,4-glucan oc-4-glucosyltransferase or starch 



synthase (EC 2.4.1.21), or for a protein derived from this enzyme, especially by 
suppression, addition or substitution of one or more amino acids, the said starch 
synthase or derived protein having the property of migrating to the sites of biosynthesis 
of starch granules in plant cells and of attaching to the starch granules, the said 
nucleotide sequence coding for the enzyme or aforementioned protein being positioned 
upstream of a nucleotide sequence coding for a peptide or polypeptide of interest. 

By starch synthase we mean, in the foregoing and hereinafter, any protein having 
the property of migrating to the sites of biosynthesis of starch granules in plant cells and 
of attaching to the starch granules, whether or not this starch synthase has conserved its 
enzymatic activity within the fusion polypeptide, encoded by an aforementioned 
recombinant nucleotide sequence, between the said starch synthase and the said 
polypeptide of interest. 

Preferably, the nucleotide sequence coding for a starch synthase, or for a derived 
protein as defined above, is selected from those coding for a starch synthase bound to 
the starch granule GBSS that occurs in particular in plants, algae or micro-algae, and 
even more advantageously for an isoform GBSSI, or for a protein derived from this 
GBSS, or GBSSI, as defined above. 

The invention relates more particularly to any recombinant nucleotide sequence as 
defined above, characterized in that the nucleotide sequence coding for a starch 
synthase, and more particularly for a GBSS, and especially for a GBSSI, is such as is 
obtained by screening a cDNA library prepared from cells that are likely to contain this 
enzyme, especially from cells of plants, algae or micro-algae, by means of an antiserum 
containing antibodies specifically recognizing the said starch synthase coded by one or 
more cDNA in the library, when the said starch synthase is expressed by a suitable 
cloning vector, the said antiserum being obtained by immunization of an animal, such as 
a rabbit, with starch extracted from the aforementioned cells. 

The invention relates more particularly to any recombinant nucleotide sequence as 
defined above, characterized in that the nucleotide sequence coding for a starch 
synthase, or for a derived protein, is selected from: 

- the nucleotide sequence of the cDNA of about 2900 to 3100 base pairs, and of 
which the 1696 base pairs of the 3' end are shown in Fig. 1, the said nucleotide 
sequence: 

. coding for the GBSSI of Chlamydomonas reinhardtii of about 640 to 680 
amino acids, especially of about 660 amino acids, of which the amino terminal 
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end corresponds to the following succession of amino acids: 
ALDIVMVAAEVAPGGKTGGLGDV, or ALDIVMVAAEVAPWSKTGGLGD- 
V, and of which the carboxy terminal end corresponds to the succession of amino 
acids shown in Fig. 1, 

. and being obtained by screening a cDNA library prepared from cells of 
Chlamydomonas reinhardtii, by means of an antiserum obtained by immunization 
of rabbits with the starch extracted from the aforementioned cells of 
Chlamydomonas reinhardtii, 

- or a nucleotide fragment of the aforementioned cDNA, coding for a peptide 
fragment of the GBSSI of Chlamydomonas reinhardtii, the said peptide fragment 
comprising the whole of the amino terminal part of the said GBSSI, and being delimited 
at its carboxy terminal end by the amino acid located in one of the positions 25 to 23 8, 
or in one of the positions 1 18 to 23 8 5 of the amino acid sequence shown in Fig. 1, 

- or a nucleotide sequence derived by degeneration of the genetic code of the 
nucleotide sequence of the aforementioned cDNA, or of an aforementioned nucleotide 
fragment of the latter, and coding for the aforementioned GBSSI of Chlamydomonas 
reinhardtii, or for an aforementioned peptide fragment of the latter, 

- or a nucleotide sequence derived from an aforementioned nucleotide sequence 
or fragment, especially by substitution, suppression or addition of one or more 
nucleotides, and encoding a peptide sequence derived from the aforementioned GBSSI 
of Chlamydomonas reinhardtii, or derived from an aforementioned peptide fragment of 
the latter, and having the property of attaching to the starch granules, the said derived 
nucleotide sequence preferably having a homology of at least about 50%, and preferably 
of at least about 70%, with the aforementioned nucleotide sequence or fragment, 

- or a nucleotide sequence capable of hybridization with one of the 
aforementioned nucleotide sequences or fragments, especially in the strict conditions of 
hybridization defined later, 

the property possessed by a starch synthase, or a fragment or a protein derived 
from the latter as defined above, of being able to attach to the starch granules, being 
measurable by the following technique: extraction of the proteins from the starch 
granules, for example according to the method described in detail below, and detection 
of the presence of the said starch synthase, or of a fragment or of a protein derived from 
the latter as defined above, especially by polyacrylamide gel electrophoresis according 
to the technique described in detail later. 
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The invention relates more particularly to any recombinant nucleotide sequence as 
defined above, characterized in that the aforementioned nucleotide sequence coding for 
a starch synthase, or for a derived protein, is more particularly selected from: 

- the nucleotide sequence of cDNA shown in Fig. 2, corresponding to SEQ ID 
NO : 1 in the sequence list given later, the said nucleotide sequence coding for the 
GBSSI of Chlamydomonas reinhardtii, 

- any fragment as defined above of the nucleotide sequence SEQ ID NO : 1 
shown in Fig. 2, and more particularly any sequence of which the nucleotide of the 5' 
end corresponds to that located in one of the positions 1 to 186 of SEQ ID NO : 1, and 
of which the nucleotide of the 3 1 end corresponds to that located in one of the positions 
1499 to 3117 of SEQ ID NO : 1, especially: 

. the sequence SEQ ID NO : 2 delimited by the nucleotides located in 
positions 15 to 2138 of SEQ ID NO : 1, coding for the GBSSI of Chlamydomonas 
reinhardtii in the form of pre-protein of 708 amino acids (SEQ ID NO : 3) 
delimited by the amino acids located at positions 1 and 708 of the peptide 
sequence shown in Fig. 2, 

. the sequence SEQ ID NO : 4 delimited by the nucleotides located at 
positions 186 to 2138 of SEQ ID NO : 1, coding for the GBSSI of 
Chlamydomonas reinhardtii in the form of a mature protein of 651 amino acids 
(SEQ ID NO : 5) delimited by the amino acids located at positions 58 and 708 of 
the peptide sequence shown in Fig. 2, 

. the sequence SEQ ID NO : 6 delimited by the nucleotides located at 
positions 186 to 1499 of SEQ ID NO : 1, coding for a fragment of 438 amino 
acids (SEQ ID NO : 7) delimited by the amino acids located at positions 58 and 
495 of the peptide sequence of the GBSSI of Chlamydomonas reinhardtii shown 
in Fig. 2, 

. the sequence SEQ ID NO : 8 delimited by the nucleotides located at 
positions 186 to 1778 of SEQ ID NO : 1, coding for a fragment of 531 amino 
acids (SEQ ID NO : 9) delimited by the amino acids located at positions 58 and 
588 of the peptide sequence of the GBSSI of Chlamydomonas reinhardtii shown 
in Fig. 2, 

- or a nucleotide sequence derived by degeneration of the genetic code of the 
aforementioned nucleotide sequences, and coding for the aforementioned GBSSI of 
Chlamydomonas reinhardtii, or for an aforementioned peptide fragment of the latter, 



- or a nucleotide sequence derived from an aforementioned nucleotide sequence 
or fragment, especially by substitution, suppression or addition of one or more 
nucleotides, and encoding a peptide sequence derived from the aforementioned GBSSI 
of Chlamydomonas reinhardtii, or derived from an aforementioned peptide fragment of 
the latter, and having the property of attaching to the starch granules, the said derived 
nucleotide sequence preferably having a homology of at least about 50%, and preferably 
of at least about 70%, with the aforementioned nucleotide sequence or fragment, 

- or a nucleotide sequence capable of hybridizing with one of the 
aforementioned nucleotide sequences or fragments, especially in the strict conditions of 
hybridization defined above. 

The invention relates more particularly to any recombinant nucleotide sequence as 
defined above, characterized in that the nucleotide sequence coding for a peptide or 
polypeptide of interest is selected from those encoding biologically active peptides, 
especially peptides of therapeutic interest or that can be used in the agricultural and 
food industry. 

The invention also relates to any recombinant nucleotide sequence as defined 
above, characterized in that the nucleotide sequence coding for a peptide or polypeptide 
of interest is selected from those encoding enzymes that are able to transfoim starch, 
such as the enzymes that interact with the a-glucans including various hydrolases, 
phosphorylases, a- 1,4 glucanotransferases, branching enzymes, amylases, and 
especially the heat-resistant hydrolases obtained from extremophiles such as the 
archaebacteria that are active at temperatures above 40°C. 

The invention also relates to any recombinant nucleotide sequence as defined 
above, characterized in that it comprises a nucleotide sequence coding for a cleavage 
site, the said nucleotide sequence being positioned between the nucleotide sequence 
coding for a starch synthase, or a protein derived from the latter, and the nucleotide 
sequence encoding the polypeptide of interest. 

As an illustration, the nucleotide sequence coding for a cleavage site is selected 
from the sequences coding for a peptide sequence of the aspartyl-proline type, which is 
very unstable at acid pH, or coding for a small peptide sequence recognized specifically 
by a protease, such as trypsin, chymotrypsin, pepsin, collagenase, thrombin, 
alasubtilisin, or recognized by chemical compounds such as cyanogen bromide. 

The invention also relates to any recombinant nucleotide sequence as defined 
above, characterized in that it comprises a promoter located upstream of the nucleotide 



sequence coding for a starch synthase, or a protein derived from the latter, as well as a 
sequence coding for transcription termination signals located downstream of the 
nucleotide sequence encoding the polypeptide of interest. 

Among the transcription promoters suitable for use within the scope of the present 
invention, we may mention: 

- for prokaryotic promoters, the Lac or T7 promoters, 

- for the eukaryotic promoters of higher plant type, the promoter 35S CaMV, or 
any type of promoter of plant origin, 

- in the "case of the transformation of micro-algae, the promoter used can be that 
of the ARG7 gene encoding arginosuccinate lyase or the promoter of the NIT1 gene 
encoding nitrate reductase. 

The invention also relates to any recombinant vector, especially of the plasmid, 
cosmid or phage type, characterized in that it contains a recombinant nucleotide 
sequence according to the invention as defined above, inserted in a site that is non- 
essential for its replication. 

The invention also relates to any cellular host, transformed by a recombinant 
vector as defined above, especially any bacterium such as Agrobacterium turnefaciens, 
and comprising at least one recombinant nucleotide sequence according to the 
invention. 

The invention also relates to any fusion polypeptide characterized in that it 
comprises: 

- in the N-terminal position, a starch synthase, or a protein derived from that 
enzyme, especially by suppression, addition or substitution of one or more amino acids, 
the said starch synthase or derived protein having the property of migrating to the sites 
of biosynthesis of the starch granules in plant cells and of attaching to the starch 
granules, 

- and, in the C-terminal position, a peptide or polypeptide of interest, 

the C-terminal part of the amino acid sequence of the starch synthase, or of the 
derived protein, being thus bound to the N-terminal part of the peptide sequence of 
interest, the said fusion polypeptide being encoded by a recombinant nucleotide 
sequence as defined above according to the invention. 

The invention relates more particularly to any fusion polypeptide as defined 
above, characterized in that it includes, in the N-terminal position, a GBSS that occurs 
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in particular in plants, algae or micro-algae, and more particularly an isoform GBSSI, or 
a protein derived from the latter as defined above. 

The invention relates more particularly to any fusion polypeptide as defined 
above, characterized in that the starch synthase is selected from: 

- the GBSSI of Chlamydomonas reinhardtii of about 640 to 680 amino acids, of 
which the amino terminal end corresponds to the following succession of amino acids: 
ALDIVMVAAEVAPGGKTGGLGDV, or ALDIVMVAAEVAPWSKTGGLGDV, and 
the carboxy terminal end corresponds to the succession of amino acids shown in Fig. 1, 
the said GBSSI being encoded by the nucleotide sequence obtained by screening a 
cDNA library prepared from cells of Chlamydomonas reinhardtii, by means of an 
antiserum obtained by immunization of rabbits with the starch extracted from the 
aforementioned cells of Chlamydomonas reinhardtii, 

- or a peptide fragment of the GBSSI of Chlamydomonas reinhardtii, the said 
peptide fragment comprising the whole of the amino terminal part of the said GBSSI, 
and being delimited at its carboxy terminal end by the amino acid located in one of the 
positions 25 to 238, or in one of the positions 118 to 238, of the amino acid sequence 
shown in Fig. 1 , 

- or a peptide sequence derived from an aforementioned peptide sequence or 
fragment, especially by substitution, suppression or addition of one or more amino 
acids, and having the property of attaching to the starch granules, the said derived 
peptide sequence preferably having a homology of at least about 60%, and 
advantageously of at least about 80%, with the aforementioned peptide sequence or 
fragment, 

the property possessed by the GBSSI of Chlamydomonas reinhardtii, or a 
fragment or a protein derived from the latter as defined above, of being able to attach to 
the starch granules, being measurable by the technique described above. 

The invention relates more particularly to any fusion polypeptide as defined 
above, characterized in that the starch synthase defined above is selected more 
particularly from: 

- the peptide sequence SEQ ID NO : 3 delimited by the amino acids located at 
positions 1 to 708 in Fig. 2, corresponding to the GBSSI of Chlamydomonas reinhardtii 
in the form of a pre-protein of 708 amino acids, 

- any fragment as defined above from the peptide sequence SEQ ID NO : 3 
shown in Fig. 2, and more particularly any sequence of which the amino acid of the 
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amino terminal end corresponds to that located in one of the positions 1 to 58 of SEQ 
ID NO : 3, and of which the amino acid of the carboxy terminal end corresponds to that 
located in one of the positions 495 to 708 of SEQ ID NO : 3, especially: 

. the sequence SEQ ID NO : 5 delimited by the amino acids located at 
positions 58 to 708 of SEQ ID NO : 3, corresponding to the GBSSI of 
Chlamydomonas reinhardtii in the form of a mature protein of 65 1 amino acids, 

. the sequence SEQ ID NO : 7 delimited by the amino acids located at 
positions 58 to 495 of SEQ ID NO : 3, corresponding to a fragment of 438 amino 
acids of the peptide sequence of the GBSSI of Chlamydomonas reinhardtii shown 
in Fig. 2, 

. the sequence SEQ ID NO : 9 delimited by the amino acids located at 
positions 58 to 588 of SEQ ID NO : 3, corresponding to a fragment of 53 1 amino 
acids of the peptide sequence of the GBSSI of Chlamydomonas reinhardtii shown 
in Fig. 2, 

- or a peptide sequence derived from an aforementioned peptide sequence or 
fragment, especially by substitution, suppression or addition of one or more amino 
acids, and having the property of attaching to the starch granules, the said derived 
peptide sequence preferably having a homology of at least about 60%, and 
advantageously of at least about 80%, with the aforementioned peptide sequence or 
fragment, 

the property possessed by the GBSSI of Chlamydomonas reinhardtii, or a 
fragment or a protein derived from the latter as defined above, of being able to attach to 
the starch granules, being measurable by the technique described above. 

The invention relates more particularly to any fusion polypeptide as defined 
above, characterized in that the polypeptide of interest is selected from the biologically 
active peptides, especially the peptides of therapeutic interest or that can be used in the 
agricultural and food industry. 

The invention also relates to any fusion polypeptide as defined above, 
characterized in that the polypeptide of interest is selected from the enzymes that are 
able to transform starch, such as the enzymes that interact with the oc-glucans, including 
various hydrolases, phosphorylases, a-l,4-glucanotransferases, branching enzymes, 
amylases, and especially the heat-resistant hydrolases obtained from extremophiles such 
as the archaebacteria that are active at temperatures above 40°C. 
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The invention also relates to any fusion polypeptide as defined above, 
characterized in that it contains a cleavage site, as described above, positioned between 
on the one hand the starch synthase or a protein derived from the latter, and on the other 
hand the polypeptide of interest. 

The invention also relates to genetically transformed plant cells, containing one or 
more recombinant nucleotide sequences as described above, integrated in their genome 
or maintained in a stable manner in their cytoplasm, the said plant cells being selected 
from the cells of plants, algae or micro-algae, capable of producing starch. 

The invention also relates to transgenic plant cells as described above containing 
one or more fusion polypeptides defined above within the starch granules contained in 
the said plant cells. 

The invention relates more particularly to the aforementioned transgenic plant 
cells, transformed with a recombinant nucleotide sequence containing the nucleotide 
sequence of cDNA of about 2900 to 3100 base pairs, and of which the 1696 base pairs 
of the 3 1 end are shown in Fig. 1, the said nucleotide sequence coding for the GBSSI of 
Chlamydomonas reinhardtii described above, or containing a fragment or a derived 
sequence as described above from the aforementioned cDNA. 

The invention relates even more particularly to the aforementioned transgenic 
plant cells, transformed with: 

- the cDNA nucleotide sequence shown in Fig. 2, the said nucleotide sequence 
coding for the GBSSI of Chlamydomonas reinhardtii, 

- any fragment as defined above of the nucleotide sequence shown in Fig. 2, 

- or a derived nucleotide sequence, as defined above, from the aforementioned 
nucleotide sequences, 

- or a nucleotide sequence capable of hybridization with one of the 
aforementioned nucleotide sequences or fragments, especially in the strict conditions of 
hybridization defined above. 

The invention also relates to genetically transformed plants, algae or micro-algae, 
or parts, especially flowers, fruits, leaves, stems, roots, seeds, or fragments of these 
plants, algae or micro-algae, containing at least one recombinant nucleotide sequence as 
defined above integrated in the genome or maintained in a stable manner in the 
cytoplasm of the cells of which they are composed. 

The invention also relates to genetically transformed plants, algae or micro-algae, 
or parts, or fragments of these plants, algae or micro-algae, as defined above, containing 
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one or more fusion polypeptides as described above within the starch granules 
contained in the plant cells of which they are composed. 

Among the plants, algae or micro-algae transformed within the scope of the 
present invention, we may mainly mention wheat, maize, potato, rice, barley, amaranth, 

5 algae of the genus Chlamydomonas, especially Chlamydomonas reinhardtii, algae of 

the genus Chlorella, especially Chlorella vulgaris, or single-celled algae of the genus 
Dunaliella (as described in the work "Dunaliella: Physiology, Biochemistry, and 
Biotechnology (1992), Mordhay Avron and Ami Ben-Amotz Publishers, CRC Press 
Inc., Boca Raton, Florida, USA"). 

10 The invention relates more particularly to the aforementioned transgenic plants, 

algae or micro-algae, transformed with a recombinant nucleotide sequence containing 
the cDNA nucleotide sequence of about 2900 to 3100 base pairs and of which/the 1696 
base pairs of the 3' end are shown in Fig. 1, the said nucleotide sequence coding for the 
GBSSI of Chlamydomonas reinhardtii described above, or containing a fragment or a 

15 derived sequence such as are described above of the aforementioned cDNA. 

The invention relates even more particularly to the aforementioned transgenic 
plants, algae or micro-algae, transformed with: 

- the cDNA nucleotide sequence shown in Fig. 2, the said nucleotide sequence 
coding for the GBSSI of Chlamydomonas reinhardtii, 

20 - any fragment as defined above of the nucleotide sequence shown in Fig. 2, 

- or a derived nucleotide sequence, as defined above, of the aforementioned 
nucleotide sequences, 

- or a nucleotide sequence capable of hybridizing with one of the 
aforementioned nucleotide sequences or fragments, especially in the strict conditions of 

25 hybridization defined above. 

The invention also relates to starch granules characterized in that they include one 
or more fusion polypeptides defined above, the said starch granules being further 
designated by the expression "transformed starch granules" or "glucosomes". 

The invention relates more particularly to the aforementioned starch granules 

30 comprising a fusion polypeptide defined above, the said fusion polypeptide containing 

the GBSSI of Chlamydomonas reinhardtii of about 640 to 680 amino acids described 
above, the amino terminal end of which corresponds to the following succession of 
amino acids: ALDIVMVAAEVAPGGKTGGLGDV, or ALDIVM VAAEVAPWSKTG- 
GLGDV, and the carboxy terminal end corresponds to the succession of amino acids 
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shown in Fig. 1, or a fragment or a derived polypeptide such as are described above of 
the GBSSI of Chlamydomonas reinhardtii. 

The invention relates more particularly to the aforementioned starch granules 
comprising a fusion polypeptide defined above, the said fusion polypeptide containing 

5 the sequence delimited by the amino acids located at positions 1 to 708 in Fig. 2, coding 

for the GBSSI of Chlamydomonas reinhardtii in the form of a pre-protein of 708 amino 
acids, or any fragment as defined above of the peptide sequence shown in Fig. 2, 
especially any sequence in which the amino acid of the amino terminal end corresponds 
to that located in one of the positions 1 to 58 in Fig. 2, and in which the amino acid of 

10 the carboxy terminal end corresponds to that located in one of the positions 495 to 708 

in Fig. 2, such as the fragments mentioned above. 

Advantageously, the aforementioned starch granules are characterized in that they 
have a diameter between about 0.1 \im and several tens of jam, and in that the 
proportion by weight of the fusion polypeptides in these granules is between about 0.1% 

15 and 1%. 

The invention also relates to any pharmaceutical composition characterized in that 
it includes transformed starch granules as defined above, if necessary in combination 
with a physiologically acceptable vehicle, the said granules containing one or more 
fusion polypeptides as defined above, the peptide of interest in the said fusion 
20 polypeptides possessing a defined therapeutic effect. 

Advantageously the aforementioned pharmaceutical compositions of the invention 
are in a form that can be administered parenterally, especially intravenously, or in a 
form that can be administered orally. 

Preferably, the aforementioned pharmaceutical compositions that can be 
25 administered parenterally are characterized in that the diameter of the starch granules is 

between about 0.1 \xm and several jam, especially between about 0.1 \xm and 10 \xm, 
and in that the proportion by weight of the fusion polypeptides in these granules is 
between about 0.1% and 1%. 

Starch granules as described above, with small diameters between about 0.1 \xm 
30 and about 10 jam, in which the proportion by weight of fusion polypeptides is between 

about 0.1% and 1%, are obtained advantageously: 



15 



- from plants or cells of plants transformed within the scope of the present 
invention and selected for their property of producing the aforementioned starch 
granules naturally, the said plants being selected in particular from rice and amaranth, 

- or from parts of transformed plants within the scope of the present invention 
the said parts of these plants producing the aforementioned starch granules naturally, 
such as the leaves of the plants, 

- or from plants or cells of plants transformed within the scope of the present 
invention, these plants being selected from plants that have mutations such that they 
produce starch- granules of small diameters as mentioned above, especially from the 
mutant plants described in Buleon A. et al., 1998, 

- or from plants or cells of plants transformed within the scope of the present 
invention, these plants being selected from the plants transformed with the aid of 
antisense nucleotide sequences of all or part of the gene coding for ADP -glucose 
pyrophosphorylase required for the synthesis of ADP-glucose in plant cells, and 
especially from the transformed plants described in the article by Miiller-Rober B. et al., 
1992. 

Advantageously, in the case of pharmaceutical compositions mentioned above 
that can be administered parenterally, the starch granules are preferably selected from 
those of amorphous structure in the case when we wish to obtain rapid release of the 
fusion polypeptide that they contain in the patient's blood, or conversely, from those of 
crystalline structure when we wish to release the fusion polypeptide progressively in the 
blood. 

By way of illustration, amorphous starch granules can be obtained from seeds 
transformed according to the invention at the germination stage, or from specific mutant 
plants such as described by Shannon J. and Garwood D., 1984, especially from the 
mutant cultivars such as "amylose extender" of maize or indeed all mutant cultivars of 
plants, algae or micro-algae whose starch is amylose-enriched. 

The starch granules according to the invention of crystalline structure, 
advantageously have about 30 to 35% of crystals, and can be obtained from seeds of 
plants, especially of cereals, that have just been harvested and at maturity, or from 
mutant plants such as described by Shannon J. and Garwood D., 1984, especially from 
the mutant cultivars such as "waxy" of maize or indeed all the mutant cultivars of 
plants, algae or micro-algae whose starch is devoid of amylose. 



16 



The invention also relates to any pharmaceutical composition characterized in that 
it includes one or more fusion polypeptides as defined above, if necessary in 
combination with a physiologically acceptable vehicle, the peptide of interest in the said 
fusion polypeptides possessing a defined therapeutic effect. 

The invention also relates to any food composition as described above, 
characterized in that it contains transformed starch granules as defined above, the said 
granules containing one or more fusion polypeptides as defined above, the peptide of 
interest in the said fusion polypeptides being usable in the food-processing field. 

The invention also relates to any food composition as described above, 
characterized in that it contains one or more fusion polypeptides as defined above, the 
peptide of interest in the said fusion polypeptides being usable in the food-processing 
field 

The present invention also relates to any method of obtaining plant cells (from 
plants, algae or micro-algae), and, if necessary, from whole plants, algae or micro-algae, 
transformed by at least one nucleotide sequence as defined above, characterized in that 
it comprises: 

- the transformation of plant cells, in such a way as to integrate in the genome of 
these cells, or maintain in a stable manner in their cytoplasm, one or more recombinant 
nucleotide sequences according to the invention, and cultivation of these transformed 
cells in vitro, 

- if necessary, the production of transformed plants from the aforementioned 
transformed cells. 

According to one embodiment of the aforementioned method of the invention, the 
transformation of plant cells can be carried out by transfer of the recombinant 
nucleotide sequence of the invention in the protoplasts, especially after incubation of the 
latter in a solution of polyethylene glycol (PEG) in the presence of divalent cations 
(Ca 2+ ) according to the method described in the article by Krens et ah, 1982. 

Transformation of the plant cells can also be carried out by electroporation 
especially according to the method described in the article by Fromm et al. y 1986. 

Transformation of the plant cells can also be carried out using a gene gun, by 
means of which metal particles coated with recombinant nucleotide sequences 
according to the invention are propelled at high velocity, thus delivering genes to the 
interior of the cell nucleus, especially in accordance with the technique described in the 
article by Sanford, 1988. 
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Another method of transformation of plant cells is the method of cytoplasmic or 
nuclear micro-injection as described in the article by De La Penna et al. 9 1987. 

According to a particularly preferred embodiment of the aforementioned method 
of the invention, the plant cells are transformed by putting the latter in the presence of a 
cellular host transformed by a vector according to the invention, as described above, the 
said cellular host being able to infect the said plant cells making it possible to integrate 
in the genome or maintain in a stable manner in the cytoplasm of the latter, recombinant 
nucleotide sequences of the invention initially contained in the genome of the 
aforementioned vector. 

Advantageously, the aforementioned cellular host employed is Agrobacterium 
tumefaciens, especially according to the methods described in the articles of Bevan, 
1984 and of An et al., 1986, or Agrobacterium rhizogenes, especially according to the 
method described in the article by Jouanin et al. 9 1987, 

Among the plant cells capable of being transformed within the scope of the 
present invention, we may mention mainly the cells of wheat, maize, potato, rice, 
barley, amaranth, Chlamydomonas reinhardtii, Chlorella vulgaris. 

According to one embodiment of the aforementioned method of the invention, the 
plant cells transformed according to the invention are cultivated in vitro, especially in 
bioreactors according to the method described in the article by Brodelius, 1988, in a 
liquid medium, or according to the method described in the article by Brodelius et aL 9 
1979, in immobilized form, or according to the method described in the article by Deno 
et aL, 1987, by culture of roots transformed in vitro. 

According to a preferred embodiment of the aforementioned method of the 
invention, the transformation of plant cells is followed by a stage of obtaining 
transformed plants by culturing the said transformed cells in a suitable medium, and, if 
necessary, fertilization and recovery of the seeds of the plants obtained in the preceding 
stage, and cultivation of these seeds to obtain plants of the next generation. 

The seeds transformed according to the invention are harvested from the 
aforementioned transformed plants, these plants being either those of the TO generation, 
i.e. those obtained from culture of transformed cells of the invention on a suitable 
medium, or advantageously those of the next generations (Tl, T2 etc.) obtained by self- 
fertilization of the plants of the preceding generation and in which the recombinant 
nucleotide sequences of the invention are reproduced in accordance with Mendel's laws, 
or the laws of extrachromosomal inheritance. 
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The invention also relates to a method of preparation of transformed starch 
granules as described above, characterized in that it comprises a stage of extraction of 
the starch granules from transformed plant cells or from plants, or from parts, especially 
flowers, fruits, leaves, stems, roots, or from fragments of these plants, transformed as 
5 mentioned above, especially by sedimentation in the conditions described later. 

Preferably, the starch granules according to the invention are those obtained by 
extraction from transformed plants, algae or micro-algae, described above, or from 
parts, or fragments of these plants, algae or micro-algae, defined above, especially by 
sedimentation -in the conditions described later, 
10 The transformed plants used for recovering the starch granules are those of the TO 

generation, or advantageously those of the next generations (Tl, T2 etc.) mentioned 
above. 

The invention also relates to a method of preparation of fusion polypeptides as 
defined above, characterized in that it comprises a stage of recovery, and if necessary of 

15 purification, of the fusion polypeptides from the aforementioned transformed starch 

granules especially in the conditions described later. 

The invention also relates to a method of preparation of a peptide of interest, 
characterized in that it comprises the implementation of a method as described above 
for obtaining plant cells or transformed plants according to the invention, the said 

20 method being carried out by transformation of plant cells with the aforementioned 

nucleotide sequences coding for a fusion polypeptide containing a cleavage site as 
described above, and includes an additional stage of cleavage of the said fusion 
polypeptide, by means of a suitable reagent, then, if necessary, a stage of purification of 
the polypeptide of interest. 

25 The invention also relates to a method of biotransformation of starch granules, 

characterized in that it comprises the following stages: 

- transformation of plant cells as defined above with the aid of host cells 
described above containing one or more nucleotide sequences coding for enzymes 
capable of transforming starch as mentioned above, 

30 - production of plants, algae or micro-algae transformed in such a way that their 

genome contains one or more nucleotide sequences described above, by culture in vitro 
of the aforementioned transformed plant cells, 

- if necessary, fertilization and recovery of the seeds of the plants obtained in the 
preceding stage, and culture of these seeds to obtain plants of the next generation, 
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- extraction of starch granules from the aforementioned transformed plants, algae 
or micro-algae, or from parts, especially flowers, fruits, leaves, stems, roots, or from 
fragments of these plants, algae or micro-algae, especially by sedimentation in the 
conditions described later, 

- if necessary, heating of the said starch granules to a temperature at which the 
peptide of interest of the aforementioned fusion polypeptide is capable of being active. 

Preferably, when the methods described above are carried out by transformation 
of plant cells, the latter are transformed with the aforementioned recombinant sequences 
containing the : -cDNA nucleotide sequence of about 2900 to 3100 base pairs, and of 
which the 1696 base pairs of the 3' end are shown in Fig. 1, the said nucleotide sequence 
coding for the GBSSI of Chlamydomonas reinhardtii described above, and more 
particularly with recombinant sequences as mentioned above containing the nucleotide 
sequence shown in Fig. 2, or containing a fragment or a derived sequence as described 
above of the nucleotide sequence shown in Fig. 2. The use of these recombinant 
sequences containing the nucleotide sequence coding for the GBSSI of Chlamydomonas 
reinhardtii described above makes it possible advantageously to avoid the development 
of effects of co-suppression in the transformed plants thus obtained. 

The invention also relates to a method of preparation of antibodies specifically 
recognizing a starch synthase bound to the starch granule, of a given plant, algae or 
micro-algae, by immunization of an animal, especially of a rabbit, with the starch 
obtained from the said plant, algae or micro-algae. 

Therefore, the invention relates more particularly to a method of preparation of 
antibodies specifically recognizing the GBSSI, of a given plant, algae or micro-algae, 
by immunization of an animal, especially of a rabbit, with the. starch obtained from the 
said plant, algae or micro-algae. 

The invention also relates more particularly to a method of preparation of 
antibodies specifically recognizing an isoform of GBSS other than GBSSI, from a given 
plant, algae or micro-algae, by immunization of an animal, especially a rabbit, with the 
starch obtained from the said plant, algae or micro-algae having a mutation such that the 
expression of the GBSSI is suppressed, for example a mutation selected from the 
following: sta2-29::ARG7 in Chlamydomonas reinhardtii (described by Delrue et al, 
1992, mentioned above), amf in the potato (described by Hovenkamp-Hermelink et al., 
1987, mentioned above), wx in maize, rice and wheat (described by Tsai, 1974, 
mentioned above), lam in the pea (described by Denyer et al, 1995, mentioned above). 
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The invention also relates to a method of obtaining starch synthase, such as 
GBSS, and more particularly for the isoform GBSSI, from a given plant, algae or micro- 
algae, by screening a cDNA library prepared from cells of the said given plant, algae or 
micro-algae, capable of containing this enzyme, using an antiserum" containing 
antibodies specifically recognizing the said enzyme encoded by one or more cDNA's 
from the library, when the said enzyme is expressed by a suitable cloning vector, the 
said antiserum being obtained according to the method mentioned above. 

The invention also relates to the nucleotide sequences coding for a starch synthase 
or for a derived protein, selected from: 

- the cDNA nucleotide sequence shown in Fig. 2, corresponding to SEQ ID 
NO : 1 in the sequence list given later, the said nucleotide sequence coding for the 
GBSSI of Chlamydomonas reinhardtii, 

- any fragment as defined above of the nucleotide sequence SEQ ID NO : 1 
shown in Fig. 2, and more particularly any sequence whose nucleotide of the 5' end 
corresponds to that located in one of the positions 1 to 186 of SEQ ID NO : 1, and 
whose nucleotide of the 3* end corresponds to that located in one of the positions 1499 
to 31 17 of SEQ ID NO : 1, especially: 

the sequence SEQ ID NO : 2 delimited by the nucleotides located at 
positions 15 to 2138 of SEQ ID NO : 1, coding for the GBSSI of Chlamydomonas 
reinhardtii in the form of pre-protein of 708 amino acids (SEQ ID NO : 3) 
delimited by the amino acids located at positions 1 and 708 of the peptide 
sequence shown in Fig. 2, 

the sequence SEQ ID NO : 4 delimited by the nucleotides located at 
positions 186 to 2138 of SEQ ID NO : 1, coding for the GBSSI of 
Chlamydomonas reinhardtii in the form of mature protein of 651 amino acids 
SEQ ID NO : 5) delimited by the amino acids located at positions 58 and 708 of 
the peptide sequence shown in Fig. 2, 

the sequence SEQ ID NO : 6 delimited by the nucleotides located at 
positions 186 to 1499 of SEQ ID NO : 1, coding for a fragment of 438 amino 
acids (SEQ ID NO : 7) delimited by the amino acids located at positions 58 and 
495 of the peptide sequence of the GBSSI of Chlamydomonas reinhardtii shown 
in Fig. 2, 

the sequence SEQ ID NO : 8 delimited by the nucleotides located at 
positions 186 to 1778 of SEQ ID NO : 1, coding for a fragment of 531 amino 
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acids (SEQ ID NO : 9) delimited by the amino acids located at positions 58 and 
588 of the peptide sequence of the GBSSI of Chlamydomonas reinhardtii shown 
in Fig. 2, 

- or a nucleotide sequence derived by degeneration of the genetic code of the 
aforementioned nucleotide sequences, and coding for the aforementioned GBSSI of 
Chlamydomonas reinhardtii, or for an aforementioned peptide fragment of the latter, 

- or a nucleotide sequence derived from an aforementioned nucleotide sequence 
or fragment, especially by the substitution, suppression or addition of one or more 
nucleotides, and encoding a peptide sequence derived from the aforementioned GBSSI 
of Chlamydomonas reinhardtii, or derived from an aforementioned peptide fragment of 
the latter, and having the property of attaching to the starch granules, the said derived 
nucleotide sequence preferably having a homology of at least about 50%, and preferably 
of at least about 70%, with the aforementioned nucleotide sequence or fragment, 

- or a nucleotide sequence capable of hybridizing with one of the 
aforementioned nucleotide sequences or fragments, especially in the strict conditions of 
hybridization defined above. 

The invention also relates to the polypeptides selected from: 

- the peptide sequence SEQ ID NO : 3 delimited by the amino acids located at 
positions 1 to 708 in Fig. 2, corresponding to the GBSSI of Chlamydomonas reinhardtii 
in the form of pre-protein of 708 amino acids, 

- any fragment as defined above of the peptide sequence SEQ ID NO : 3 shown 
in Fig. 2, and more particularly any sequence whose amino acid of the amino terminal 
end corresponds to that located in one of the positions 1 to 58 of SEQ ID NO : 3, and 
whose amino acid of the carboxy terminal end corresponds to, that located in one of the 
positions 495 to 708 of SEQ ID NO : 3, especially: 

the sequence SEQ ID NO : 5 delimited by the amino acids located at 
positions 58 to 708 of SEQ ID NO : 3, corresponding to the GBSSI of 
Chlamydomonas reinhardtii in the form of mature protein of 651 amino acids, 

the sequence SEQ ID NO : 7 delimited by the amino acids located at 
positions 58 to 495 of SEQ ID NO : 3, corresponding to a fragment of 438 amino 
acids of the peptide sequence of the GBSSI of Chlamydomonas reinhardtii shown 
in Fig. 2, 

the sequence SEQ ID NO : 9 dehmited by the amino acids located at 
positions 58 to 588 of SEQ ID NO : 3, corresponding to a fragment of 531 amino 
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acids of the peptide sequence of the GBSSI of Chlamydomonas reinhardtii shown 
in Fig. 2, 

- or a peptide sequence derived from an aforementioned sequence or peptide 
fragment, especially by substitution, suppression or addition of one or more amino 
acids, and having the property of attaching to the starch granules, the said derived 
peptide sequence preferably having a homology of at least about 60%, advantageously 
at least about 80%, with the aforementioned peptide sequence or fragment, 

the property possessed by the GBSSI of Chlamydomonas reinhardtii, or a 
fragment or a protein derived from the latter as defined above, of being able to attach to 
the starch granules, being measurable by the method described above. 

The invention also relates to polyclonal or monoclonal antibodies, directed against 
the aforementioned polypeptides. 

The invention will be further illustrated by means of the following detailed 
description of cloning of the gene coding for the GBSSI of Chlamydomonas reinhardtii, 
and obtaining transformed starch granules containing a fusion polypeptide with the said 
GBSSI, as well as by means of Fig. 1 showing the nucleotide sequence and the protein 
sequence deduced from the cDNA insert of the CD142 clone coding for the GBSSI of 
Chlamydomonas reinhardtii (the underlined sequence corresponds to one of the three 
regions that are highly conserved across all the starch and glycogen synthases and is 
probably involved in fixation of the ADP-glucose substrate). 

I) Cloning of the cDNA (complementary DNA) and gDNA (genomic DNA) 
sequences corresponding to the structural gene of the GBSSI of Chlamydomonas 
reinhardtii. 



A) Cloning of the cDNA 

The strategy developed for cloning the cDNA corresponding to the structural gene 
of the GBSSI of Chlamydomonas reinhardtii makes use of screening of an expression 
library using a polyclonal antiserum. The antiserum is able to recognize a polypeptide 
sequence encoded by a cDNA expressed from a suitable cloning vector. 

a) Production of the antiserum 
In order to produce an antiserum^ capable of specifically recognizing the GBSSI of 
C reinhardtii, the starch obtained from the wild strain (137C) was injected on three 
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occasions into an albino New Zealand hybrid rabbit. In a similar experiment, the 
residual starch from a double mutant strain at loci STA2 and STA3 (IJ2) was injected in 
the rabbit in the same conditions. 



Detailed protocols: 

- Genotypes of the strains of C. reinhardtii : 
137C : mt-nitl nit2 

IJ2 : mt-nitl nit2 sta2-29::ARG7 sta3-l 
The 137C strain is the reference strain for all the studies of starch metabolism 
carried out in C. reinhardtii. The IJ2 strain was fully described by Maddelein et al. in 
1994. In this double mutant strain at the STA2 and STA3 loci, the GBSSI and SSII 
activities are absent simultaneously. The mutation at the STA2 locus was generated by 
gene interruption by means of the pARG7 plasmid (Maddelein et al., 1994) and leads to 
complete disappearance of the GBSSI from the starch granule, whereas the mutant 
allele of the STA3 gene was generated by mutagenesis by X-rays (Fontaine et al., 1993). 

_ Conditions for culture, extraction and purification of the starch : the cells were 
cultured for 3 days in the TAP medium with continuous illumination (3000 lux) from an 
inoculum of 5x1 0 4 cells/ml. The main culture is stopped when the cell concentration 
reaches about 2xl0 6 cells/ml. 

Composition of the TAP medium (values for one litre of medium): 

NH 4 C1 0,40 g ZnS0 4 .7H 2 0 22 mg 

Tris 2,40 g H3BO3 ll,4mg 

KH 2 P0 4 0,32 g MnCl 2 .4H 2 0... 5,1 mg 

K 2 HP0 4 1,47 g FeS0 4 .7H 2 0..' 4,2 mg 

CaCl 2 .2H 2 0 0,05 g M0O3 1,8 mg 

MgS0 4 .7H 2 0 0,30 g CoCl 2 .6H 2 0 1,6 mg 

EDTA 50 mg Cu0 4 .5H 2 0 1,6 mg 

The pH of the medium is adjusted to 7 with glacial acetic acid 

The TAP-N medium has the same base composition, but this medium differs from 
the first by the absence of nitrogen supplied in the form of ammonium chloride, which 
is replaced with sodium chloride at the same concentration; it is in these culture 
conditions that the cells accumulate a quantity of starch representing up to twenty times 
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that of cells cultivated in TAP medium. In this case, culture is conducted for 5 days in 
continuous light starting from a culture inoculated at 5 x 10 5 cells/ml. 

The cells are then concentrated by centrifugation at 2-4x1 0 8 cells/ml (Tris/acetate 
buffer pH 7.5 50 mM; EDTA 10 mM; DTT 2.5 mM) then subjected to the action of a 
French press at 10000 psi. The extract obtained at press discharge is centrifuged at 5000 
g for 15 min at 4°C. The deposit containing the starch is resuspended in one volume of 
water, to which are added nine volumes of Percoll (Pharmacia, Uppsala, Sweden) 
before being centrifuged at 10000 g for 30 min at 4°C. The Percoll forms a density 
gradient during centrifugation. The starch, which has a high density (1.3 to 1.5), settles 
to the bottom of the tube whereas the lipids and other cell debris of low density form a 
"cap" at the surface of the Percoll gradient. The starch deposit is then rinsed three times 
with deionized water then stored at 4°C after removing from it the last supernatant from 
rinsing. 

- Conditions for immunization of the rabbit, taking and preparation of the 
antiserum : the rabbit used in this experiment is an albino New Zealand hybrid rabbit. 
Three successive injections were made at intervals of three weeks with 20 mg of 
purified starch suspended in 500 jal of water. 500 jjtl of standard Freud adjuvant was 
added to this suspension. Blood samples were taken from the rabbit 3 weeks after the 
last injection. The serum is prepared by the single centrifugation of the blood after 24 
hours of coagulation at 4°C. The antisera generated by the injections of the starches of 
the 137C and IJ2 strains are identified in the following by the designations "antiserum 
SA137C" and "antiserum SAIJ2" respectively. 

b) Preparation and screening of the cDNA library 
The cDNA library was produced from mRNA's purified from the wild strain of C. 
reinhardtii. The X ZAP expression vector was used. 

Detailed protocols: 

- Preparation of the complete RNA's of C reinhardtii : this method is an 
adaptation of the method used for extracting RNA's from the leaves of Arabidopsis 
thaliana. The cells of a culture of 1-2 x 10 6 cells/ml are harvested by centrifugation at 
3500 g for 15 min at 4°C. The cells are then divided into aliquots with a volume of 
about 200 jlxI. At this stage, the cells are frozen in liquid nitrogen and can be stored at 
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-80°C for several months, 400 jul of n Z6" buffer of the following composition is added 
to the 200 (il of frozen cells: 

Buffer Z6: MES/NaOH pH 7.0 20 mM 

EDTA 20 mM 

5 Guanidine-HCl 6 M 

(3-Mercaptoethanol 100 |uM. 

The mixture is stirred very vigorously for several minutes, then 400 \xl of 
phenol/chloroform/isoamyl alcohol mixture (25v/24v/lv) is added and the mixture is 
stirred vigorously again for several minutes. The whole is centrifuged at 13000 g for 10 
10 min at 4°C. After recovering and then estimating the volume of the supernatant, 1/20 

volume of acetic acid at 1 M as well as 0.7 volume of 100% ethanol are added. The 
nucleic acids are given time to precipitate at -20°C for at least 30 min. After 
centrifugation at 13000 g for 15 min at 4°C, the pellet is resuspended in 400 \xl of 3 M 
sodium acetate pH 5.6 then centrifuged for 10 min at 13000 g at 4°C. The pellet is then 
15 rinsed twice with 70% ethanol, dried and finally dissolved in 50 jul of water treated with 

DEPC. The quantity of nucleic acids is determined in a spectrophotometer at 260 nm 
(OD 2 60 == 1 is equivalent to about 40 \xg/ml of nucleic acids). 

- Construction of a cDNA library in the X ZAP vector : the RNA's having a 
polyA tail (the mRNA's in particular) are isolated from the total RNA preparation using 
20 the kit "polyATtract mRNA isolation systems" marketed by Promega (Madison, WI, 

USA). Synthesis of the cDNA's, ligation in the X ZAP vector and packaging in the 
capsids are effected using the kit "cDNA synthesis kit, ZAP-cDNA synthesis kit and 
ZAP-cDNA gigapack II gold cloning kit" marketed by Stratagene (La Jolla, CA, USA). 
The procedure followed corresponds to the instruction manual supplied with the kit. 

25 - Immunological screening of a cDNA library in an expression vector : screening 

of the cDNA expression library of X ZAP from C. reinhardtii was carried out using the 
antiserum previously obtained (see above). About 100000 lysis plates are spread by the 
Top-agar technique on several Petri dishes containing bacterial growth medium and the 
adapted antibiotic. After incubation for 3 hours at 37°C, nitrocellulose filters (Protan 

30 BA 85, Schleicher & Schuell, Dassel, Germany), previously immersed in a solution of 

IPTG 10 mM and dried, are applied to the surface of the Top-agar. The dishes are 
incubated again for 3 hours at 37°C before being stored at 4°C for 30 min. The 
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nitrocellulose filters are then carefully removed from the agar surface. The E. coli strain 
XL 1 -blue was used during screening of the XZAP library. The protocol for filter 
development is then the same as that used in the Western Blot study (see the section 
dealing with Western Blot) . 
5 The positive lysis regions are subjected to two successive series of screening with 

the same antiserum in order to confirm their positive character, and also to purify them. 
When a lysis region is found to be positive at the end of three screenings, the sequence 
of the plasmid pBluescript SK+ containing the insert of interest is excised from the X 
phage in vivo.- It is the "ExAssist helper phage" that is used for cotransfection of the 

10 SOLR strain with the X ZAP phage. In this way we obtain a phagemid that is used for 

infecting the strain XL 1 -Blue MRF ! leading to restoration of the double-stranded 
plasmid pBluescript SK+ bearing the cDNA of interest. 

Screening of this kind, conducted with the SA137C antiserum, led to the 
production of a single positive clone after three screenings. We designated this clone 

15 "CD 142". The insert of the CD 142 clone has a size of 1696 bp (see the sequence in Fig. 

1). 

c) Sequence analysis of the insert of the CD 142 clone 
When the protein sequence libraries are interrogated with the sequence derived 

20 from the cDNA clone "CD142", the greatest similarities are obtained with the GBSSI of 

the higher plants. This first indication of the origin of this cDNA is reinforced by the 
presence of an extension of 119 amino acids (about 14 kDa) in the carboxy terminal 
position of the coding sequence, relative to the main GBSSI's of the higher plants. In 
fact, the molecular weight of the GBSSI of C. reinhardtii, determined by SDS-PAGE, is 

25 on average 10 to 15 kDa higher than that of the corresponding proteins in plants. The 

119 amino acid extension might explain this difference in molecular weight between 
GB SSI's of different origins. Taken separately, this extension of the coding sequence 
does not share any similarity with other known types of polypeptide sequences. 

The presence of the UAA stop codon in position 717 signals the start of a very 

30 long non-coding region of 946 bp. These noncoding regions in 3 ! terminal position, 

which frequently occur in the nuclear genes of Chlamydomonas, seem in particular to 
be intended to stabilize the messenger. 
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B) Cloning ofgDNA 

The gDNA relating to the CD 142 clone was isolated after screening an indexed 
gDNA library in cosmids (Zhang et al. ? 1994). Constructed in a cosmid vector derived 
from c2RB, this gDNA library is contained in 120 96-well microtitration* plates. Each 
well (apart from two, to facilitate orientation of the plate) contains a bacterial clone 
transporting a single cosmid. The whole library thus represents 11280 clones for which 
the average size of the inserts is approx. 38 kb. The nuclear genome of C. reinhardtii is 
therefore represented there statistically about four times. 

Screening of this library with a probe corresponding to the CD 142 clone led to the 
isolation of a genomic DNA clone designated 18B1. The insert present in this single 
cosmid was analysed in more detail. After restriction by NotI then hybridization with 
the CD 142 probe, only a band of about 9 kb remains positive, indicating that all the 
information corresponding to the CD 142 clone is present in this fragment. The genomic 
sequence corresponding to the CD 142 clone is presented below. 

Detailed protocols: 

- Preparation of nylon filters for screening : the nylon filters (Hybond N, 
Amersham Buchler, Braunschweig, Germany) are carefully placed on a Petri dish 
containing a rich bacterial growth medium supplemented with the appropriate antibiotic 
(in the present case, ampicillin is used). Each E. coli clone contained in the library is 
then replicated directly on the nylon filter using a replicating apparatus and the dishes 
thus prepared are incubated overnight at 37°C. The filters are then removed from the 
agar surface and subjected to the following treatment: 

(1) 2 min with a denaturing solution (NaOH 0.5 M; NaCl 1 .5 M) 

(2) 2 min with a neutralizing solution (Tris/HCl pH 7.0 0.5 M; NaCl 1.5 M) 

(3) 2 min with a rinsing solution (buffer 2 x SSC) 

Finally the filters are incubated in a drying cabinet for 2 hours at 80°C. 

- Prehybridization and hybridization of the filters : prehybridization is carried out 
in the hybridization buffer at 42°C for at least 4 hours. Hybridization is effected at 42°C 
for a whole night in the presence of the 32 P-labelled nucleotide probe. The membrane is 
washed at 60°C in the washing solution, adjusted to the stringency that we wish to 
apply. The time and frequency of replacement of the washing baths vary depending on 
the stringency and the radioactivity levels detected on the membrane. In general, the 
baths are renewed every 10 min and washing begins with a washing buffer of low 
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stringency and ends with a buffer of greater stringency. A Kodak X-OMAT AR film is 
finally exposed to the filters at -80°C in order to detect the positive clones. 

Composition of the solutions and buffers: 

Buffer SSC x 20: Dissolve 175.3 g of NaCl and 88.2 g of sodium citrate in 800 ml 
5 of water. Adjust the pH to 7.0 with a few drops of a 10 N solution of NaOH. Make up to 

1 litre with water. 

Hybridization buffer: 

Formamide 50% 

Denhardt's x 5 

10 SDS 0.5% 

Na phosphate buffer pH 7.0 50 mM 

DNA of salmon sperm 1 00 |j,g/ml 

Bovine serum albumin 0.5% 

Denhardt's reagent x 100 (quantity for 500 ml in water): 
15 Ficoll400 10 g 

Polyvinylpyrrolidone 40 (P VP40) 1 0 g 

BSA 10 g 

Phosphate buffer pH 7.0 at 1 M (quantity for 100 ml of buffer): 
Na 2 HP0 4 1 M 57.7 ml 

20 NaH 2 P0 4 1 M 42.3 ml 

Washing buffers: 

low stringency: SSC x 2; SDS 0.2% 

medium stringency: SSC x 1; SDS 0.5% 

high stringency: SSC x 0.5; SDS 0.5% 

25 SSC x 0.1; SDS 0.5% 

- Preparation and labelling of a nucleotide probe with 32 P : the fragment serving 
as nucleotide probe is generally inserted in the multiple cloning site of a bacterial 
plasmid. It is therefore first necessary to digest it with the appropriate restriction 
30 endonucleases then separate the fragment of interest from the rest of the plasmid by 

electrophoresis on 1% agarose gel buffered with TAE buffer x 1. The band 
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corresponding to the fragment of interest is then cut out of the gel and DNA extraction 
is effected with the kit "The GENECLEAN II Kit" marketed by BIO 101 Inc. (La Jolla, 
CA, USA). The piece of agarose is firstly dissolved in a 6 M sodium iodide solution. On 
completion of solution, the DNA molecules are then captured with a silica matrix 
designated "Glassmilk". The DNA molecules, in the presence of the Nal chaotropic 
agent, are adsorbed specifically on the silica beads. After eliminating the salts and the 
dissolved agarose, the DNA molecules are eluted from the silica beads in the presence 
of sterile water. 

Labelling of a nucleotide probe with 32 P is accomplished using the "Random 
primed DNA labelling kit" from Boehringer (Mannheim, Germany). The principle is 
random priming of the elongation reaction by Klenow DNA polymerase using a mixture 
of hexanucleotides representing all the possible combinations of sequences. The 
radioactive element is incorporated starting from [oc- 32 P]-dCTP (3000 Ci/mmol) of 
which 50 |iiCi is used for each labelling reaction. The radiolabeled probe is finally 
added to the hybridization solution after denaturing at 95°C for 4 min. 

C) The STA2 locus in C reinhardtii represents the structural gene of the GBSSI 
The following analysis demonstrates formally that the STA2 gene of C reinhardtii 
corresponds to the structural gene of the GBSSI and that the CD142 clone represents a 
cDNA that comes from this locus. In fact, restriction analyses of the genomic DNA 
digested by the BamHI endonuclease reveal a profound change of the restriction profile 
in the mutant BAFR1 strain at the STA2 locus generated by gene interruption (Delrue et 
al., 1992). The same change is also observed in the double mutant IJ2 strain at the STA2 
and ST A3 loci which Maddelein et al. (1994) generated by crossing the BAFR1 strain 
with a sta3-l mutant strain. 

Moreover, this change of the restriction profile in the meiotic progeny of the LT2 
strain fused with the "CS9" strain of C. smithii could be followed in the following 
crossing: 
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CS9 



X 



IJ2 



(C smithii) 



(C. reinhardtii) 



+/+ 



sta2-29::ARG7/sta3-l 



+/+ 



25% 



sta2-29::ARG7/sta3-l 



25% progeny 



sta2-29::ARG7/+ 



25% meiotic 



+/sta3~l 



25% 



352 segregants resulting from this crossing were purified, amplified and their 
starch accumulation phenotype was analysed. 54 meiotic recombinants underwent 
restriction analysis: 21 of genotype sta2-29::ARG7/+ y 19 of genotype +/sta3-l and 14 
wild. With regard to the 21 segregants of genotype sta2~29::ARG7/+ their restriction 
profile, obtained by digestion with BamHI and hybridized with the CD 142 probe, still 
has the same change as the parental strain IJ2. We deduce from this that the STA2 gene 
and the CD 142 probe are very strongly linked genetically. There is no longer any doubt 
as to the nature of the CD142 clone, which represents the structural gene of the GBSSI 
(the STA2 locus). 

Detailed protocols: 

Carrying out the crossings : before carrying out the fusion of cells with opposite 
sexual polarities, it is necessary to put them in a state that is favourable to their fusion. 
Thus, the cells must first be differentiated into gametes before they are put in direct 
contact. Gametogenesis is induced in Chlamydomonas by subjecting the cells to a 
nitrogen deficit and in the presence of a strong light source (5000 lux). For this, fresh 
cells cultivated on rich agar medium (culture of less than 5 days) are suspended in 2 ml 
of TAP-N medium and left for at least 12 hours in strong light without agitation. The 
state of the cells is examined with an optical microscope before being brought into 
contact. After differentiation into gametes, the cells, are smaller and in particular much 
more active than in the case of a non-deficient culture. Equivalent quantities of cells of 
each sexual polarity are mixed. Fusion is always carried out in strong light. After one 
hour of contact, cell fusions are already visible in the optical microscope. Analysis of 
the meiotic segregants will consist of depositing the products of cellular fusion on a rich 
medium at 4% agar. The dishes thus obtained are incubated in diffuse light for 15 hours 
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then stored in total darkness for at least a week. This permits maturation of the zygotes 
and their "encystment" in the 4% agar. After this period of incubation in darkness, the 
dishes are returned to the light and the following stages are carried out as quickly as 
possible. In order to eliminate the greatest possible number of unfused vegetative cells, 
the surface of the agar is scraped very lightly with a razor blade. Observing with a 
binocular magnifier, a region containing about fifty zygotes is marked off and these are 
transferred to a fresh dish of rich medium at 1.5% agar. To be sure of complete 
disappearance of residual vegetative cells, the dish is subjected to chloroform vapours 
for 45 seconds to 1 minute (in contrast to other cells, zygotes can withstand this 
moderate time of exposure to chloroform vapours). The presence of light will 
irreversibly trigger the start of meiosis of the zygotes. During their germination 
(facilitated by the higher moisture content of the medium containing 1.5% agar) the 
zygotes will release four haploid daughter cells (a tetrad), which will grow by mitotic 
divisions and form colonies in the dish. Analysis of the meiosis products can be effected 
in two ways. The first consists of random investigation of at least 200 segregants 
resulting from the crossing. After purification of the segregants, the characters of the 
latter can be studied by replication on different selective media. 

Techniques of extraction of genomic DNA : the protocol adopted for extraction of 
total DNA is that described by Rochaix et al. (1991); here are the details: 

(1) Centrifuge 10 ml of cell culture to about 3-5 x 10 6 cells/ml for 10 min at 3500 
g in a 1 5 ml bottle, 

(2) The pellet of cells is then resuspended in 350 pi of the following buffer: 
Tris/HClpH8.0 20 mM 

EDTA 50 mM 

NaCl 100 mM. 

(3) Add 50 \xl of proteinase K from a stock solution at 2 mg/ml (if unavailable, it 
is possible to use pronase at 10 mg/ml). 

(4) Add 25 jlxI of SDS at 20% and incubate for 2 hours at 55°C. 

(5) Add 2 jllI of diethylpyrocarbonate (DEPC) and incubate for 15 min at 70°C. 

(6) Cool the tube briefly in ice and add 50 \i\ of 5 M solution of potassium acetate. 

(7) Mix, by shaking the tube correctly, and leave on ice for at least 30 min (it is 
possible to stop the extraction at that moment and resume on the next day if the tubes 
are left in ice in a coldroom). 



32 



(8) Transfer to a 1.5ml Eppendorf tube and centrifuge for 15 min in a 
minicentrifuge (at about 13000 g). 

(9) Recover the supernatant, transferring it to a new Eppendorf tube. 

(10) Extract the supernatant with one volume of the following mixture: 

Phenol (saturated with TE : Tris/HCl pH 8.0 10 mM, EDTA 1 mM) 25 vol 
Chloroform 24 vol 

Isoamyl alcohol 1 vol 

(11) After extraction, add 1 ml of 100% ethanol at room temperature. A 
precipitate should be seen to appear in the form of "angel hair" if extraction is 
successful. From this moment, manipulations must be careful and gentle so that the 
DNA molecules do not break. 

(12) Centrifuge for 5 min in a minicentrifuge (about 13000 g). 

(13) Rinse the pellet with 70% ethanol and centrifuge for 3 min in a 
minicentrifuge. 

(14) Repeat operation (13) once or twice for proper elimination of the salts. 

(15) Dry the pellet for 5 min at 37°C 5 then dissolve it in 50 \il of TE containing 
bovine pancreas RNase at 1 jj,g/ml. 

Molecular hybridizations and Southern Blot analyses : 25 [ig of DNA is digested 
completely with the appropriate restriction endonuclease(s). The restriction products are 
then separated by electrophoresis in 0.8% agarose gel, TBE x 1. Then the gel is 
incubated successively for 15 min in the depurination solution and for 30 min in the 
denaturing solution. The denatured DNA is then transferred onto "Porablot NYplus" 
nylon membrane (Macherey-Nagel GmbH, Diiren, Germany) by capillarity with SSC 
buffer x 20. After transfer, the membrane is incubated at 80°C in absence of air for 2 
hours to fix the DNA fragments to the surface of the nylon membrane. Prehybridization 
is effected in the hybridization buffer at 42°C for at least 4 hours. Hybridization is 
accomplished at 42°C for a whole night in the presence of the labelled probe prepared 
previously. The membrane is washed at 60°C in the washing buffer adjusted to the 
stringency that is to be applied to the washing. The time and frequency of replacement 
of the washing baths vary depending on the stringency and the levels of radioactivity 
present on the membrane. In general, the baths are renewed every 10 min and washing 
begins with a washing buffer of low stringency and ends with a buffer of higher 
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stringency. A Kodak X-OMAT AR film is finally exposed to the membrane at -80°C to 
reveal the hybridization zones. 

II) Investigation of binding of the GBSSI to the starch grannie. 
A) Analysis of the sta2-J mutant allele 

Among all the mutant alleles generated at the STA2 locus in C. reinhardtii, just 
one leads to the production of a 58 kDa truncated GBSSI in place of the 76 kDa wild 
protein. This is the sta2-l allele of the 18B strain. Deluxe et al. (1992), by micro- 
sequencing of the GBSSI extracted from a polyacrylamide gel, were able to demonstrate 
that the amino terminal peptide sequences of the proteins of the wild strain (137C) and 
of the mutant strain (18B) are identical. 

Amino terminal sequences: 

® GBSSI of the 137C strain: ALDIVMVAAEVAPGGKTGGLGDV 
® GBSSI of the 18B strain: ALDIVMVAAEVAPGGKTGGLGDV 
The protein produced by the sta2-l mutant allele is therefore truncated in the 

carboxy terminal position and the K m for ADP-glucose is increased by a factor of 6. 

Absence of this carboxy terminal sequence does not, however, alter the properties of 

fixation of the protein on the granule, as is shown in Fig. 1 . 

Detailed protocol: 

Technique of extraction of the proteins from the starch granule and SDS-PAGE : 
the proteins are extracted from 0.3 to 1 mg of starch with 60 of extraction buffer: (3- 
mercaptoethanol 5% (v/v); SDS 2% (w/v) at 100°C for 5 min. After centrifugation at 
13000 g for 10 min, the supernatant is recovered and the operation is repeated once with 
the pellet. The two supernatants are combined and the sample can be loaded into the gel 
wells after adding again 10 \il of the following loading buffer: Tris 50 mM, glycine 384 
mM, 20% glycerol, SDS 0.1%, bromophenol blue 0.001%. Migration is carried out at 
room temperature, at 150 V for 1 h 30 (until the bromophenol blue leaves the gel). The 
proteins are revealed by staining with Coomassie 'blue or by immunodetection (see 
below; section relating to Western Blot). During staining with Coomassie blue, the gel 
is incubated for 30 min in the following solution: 2 g of Coomassie Brilliant Blue R250, 
0.5 g of Coomassie Brilliant Blue G250, 425 ml of ethanol, 50 ml of methanol, 100 ml 
of acetic acid; water sufficient for 10,00 ml. The gel is then decolorized using the 
following solutions: 
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> 15 to 30 min in decolorizer I: 450 ml of ethanol, 50 ml of acetic acid; make up 
to 1000 ml with water. 

> one night in decolorizer II: 80 ml of acetic acid, 50 ml of methanol, make up to 
1000 ml with water; this decolorizer II removes the nonspecific coloration of the gel. 

> decolorizer III (240 ml of acetic acid, 200 ml of methanol, make up to 1000 ml 
with water) permits complete decolorizing of the gel if necessary. 

B) Determination of the quantity of proteins bound to the granule 

The quantity of proteins bound to the starch granule was determined in different 
culture conditions and in various gene libraries. For this, the cells were placed in 
conditions of massive accumulation of starch (nitrogen-deficient medium) or in 
conditions of mixotrophic growth (presence of nitrogen). The proteins extracted from 
the granule were then deposited on polyacrylamide gel in denaturing conditions 
(presence of SDS). After migration, the proteins are revealed by staining with 
Coomassie blue. The 17 strain, mutant at the STA1 locus, was used during this 
experiment. This mutation was described in detail by Van den Koornhuyse et ah (1996) 
and then by Van de Wal et al. (1998). The STA1 locus corresponds to the structural gene 
of the large regulatory subunit of ADP-glucose pyrophosphorylase. The stal-1 mutation 
produced during X-ray mutagenesis leads to insensitivity of the enzyme to 3- 
phosphoglyceric acid, its alios teric activator. Consequently, the 17 strain accumulates 
less than 5% of the normal quantity of starch. The estimate of the quantity of GBSSI 
bound to the granule is approx. 0.1% of the weight of starch in conditions of nitrogen 
deficiency for the 137C and 18B strains. This value reaches 1% in conditions of 
mixotrophy. In the case of the 17 strain, regardless of the culture conditions, the GBSSI 
represents more than 1% of the weight of the starch granule. ^The techniques employed 
in this analysis are the same as those described in the preceding paragraph. 

C) Analysis of immune response in Western Blot 

To test the antigenicity of the SA137C and SAIJ2 antisera obtained previously in 
the rabbit, the proteins extracted from 100 |itg of fresh starch obtained from different 
strains cultivated in variable culture conditions were subjected to analysis by the 
immunotransfer technique (Western Blotting). The immune response produced with 
respect to GBSSI during injection of the starch from the wild strain (137C) proves very 
specific and strong (the proteins having been extracted from just 100 \ig of fresh starch) 
even in the case of the truncated protein in the sta2-l mutant. The quantity of proteins 
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bound to the starch granule seems larger in the 17 mutant during nitrogen-deficient 
culture, as shown by the presence of a mass band higher than GBSSI revealed by the 
SA137C antiserum. This is confirmed by Western Blot analysis effected with the SAIJ2 
antiserum, where the strongest immune response is detected with the proteins extracted 
from the starch of the 17 strain cultivated with nitrogen deficiency. 

For control purposes, we carried out the same type of experiment using the PAS 5 
antiserum obtained by Abel et al (1995). This antiserum produced in the rabbit is 
directed against a peptide whose consensus sequence corresponds to the strongly 
conserved carboxy terminal region in all the starch synthases of higher plants, whether 
they are soluble or bound to the starch granule. This antiserum recognizes the GBSSI of 
C. reinhardtii specifically when the latter is present in the granule. Moreover, the PA55 
antiserum also recognizes the truncated protein produced by the 18B mutant (sta2-l). It 
therefore appears that the highly conserved sequence in carboxy terminal position is still 
present in the truncated protein. 

Detailed protocols: 

Technique of protein extraction from the starch granule and SDS-PAGE : these 
techniques are the same as those described in the preceding section apart from staining 
with Coomassie blue, which is omitted in this case. 

Technique of transfer and detection with antisera : when migration on SDS-PAGE 
has ended, the gel is incubated for 30 min in the "Western" buffer x 1 containing 20% 
of methanol. The proteins are then electrotransferred onto a nitrocellulose membrane 
(Protan BA 85 Schleicher & Schuell, Dassel, Germany) using electrotransfer apparatus 
(Multiphor II, LKB-Pharmacia, Bromma, Sweden) at 4°C in^ the following conditions: 
45 min at 250 mA with the buffer used previously. After this stage of transfer of the 
proteins onto the nitrocellulose membrane, the latter is incubated for 1 hour at room 
temperature in TBST buffer containing 3% of BSA. The membrane is then rinsed three 
times in TBST buffer before being incubated overnight at 4°C in the rabbit primary 
antiserum diluted in TBS buffer. The membrane is again rinsed three times with TBST 
buffer and is then incubated for 1 hour at room temperature with the biotinylated 
secondary antibody diluted at 1/500 in the TBS directed against the rabbit antiserum. 
After three further rinses in TBST buffer, the membrane is incubated for 30 min at room 
temperature with the streptavidin-alkaline phosphatase complex at 1/3000 dilution in 
the TBS buffer. Finally, after 3 rinses in TBST buffer, the membranes are developed by 
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incubation in a diethanolamine buffer containing the substrates of alkaline phosphatase: 
NBT and BCIP (the incubation time varies depending on the intensity of reaction). The 
detection kit used is the one offered by Amersham Buchler (Braunschweig, Germany): 
"Blotting detection kit for rabbit antibodies" 

Compositions of solutions and buffers: 

"Western" buffer x 10: Glycine 390 mM 

Tris 480 mM 

SDS 0.375% 
TBS buffer (Tris Buffer Saline) Tris/HCl pH 7.5 20 mM 

NaCl 500 mM 

TBST buffer (Tris Buffer Saline Tween): TBS + 0.1% (v/v) Tween 20 
NBT: Nitro-Blue Tetrazolium in solution in dimethylformamide 70% 
BCIP: 5-Bromo-4~Chloro-3-Indolyl Phosphate in solution in 
dimethylformamide. 

Ill) Targeting of fusion proteins in the starch granule 

The specific carboxy terminal extension of the GBSSI of C. reinhardtii is not 
required for targeting the protein to the starch granule in vivo as we were able to 
demonstrate in the previous experiments. The extension of about 16 kDa can be 
replaced by a peptide sequence of interest, thus permitting its targeting to the very heart 
of the starch granule. 

The various types of vectors that can be constructed for applying this method in 
higher plants consist of: 

- a bacterial selector gene and a bacterial replication origin in order to be able to 
amplify the plasmid in a suitable bacterial strain 

- a selector gene that will permit easy selection of transformant plants 

- translational fusion between the coding sequence of the GBSSI and a 
polypeptide sequence of interest. Two main types of translational fusions may be 
considered: in the first case, it is the 58 kDa truncated sequence from GBSSI that is 
fused with the sequence of interest; in the second case, the complete sequence of the 
GBSSI is employed. 
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- fusion can be put under the control of a strong constitutive plant promoter, or 
of an inducible plant promoter, immediately followed by a suitable transit peptide 
promoting translocation of the fusion protein to the chloroplast. 

IV) Protocol for determination of the activity of granule-bound starch synthase: 
Add 20 |ng of starch to 100 jlxI of the following reaction mixture: 

Glycylglycine/NaOH pH 9.0 50 mM 

(NH 4 ) 2 S0 4 100 mM 

P-mercaptoethanol 5 mM 

MgCl 2 5 mM 

Bovine serum albumin 0.5 mg/ml 

ADP-glucose 0.2 mM 

[U 4 C] ADP-glucose (235 mCi/mmol) 2.66 jag 

Trisodium citrate 0 or 0.5 M (specified depending on 

circumstances) 

The reaction is carried out at 30°C for 15 min and is then stopped by adding 3 ml 
of 70% ethanol. The precipitate obtained is filtered under vacuum on a "Whatman Glass 
Fibre" filter (Whatman, Maidstone, UK), and rinsed with 4 x 5 ml of 70% ethanol. A 
Beckman counter is used for radioactivity counting after the filters have been placed in 
counting phials containing 3.5 ml of scintillation liquid. 

V) The methods of starch extraction and purification are as follows: 

- in the case of a single-celled green algae such as Chlamydomonas reinhardtii 
(see the method described above) 

- in the case of seeds, tubers or any other organ of higher plants: 

the organ or the type taken from the plant is properly homogenized (after grinding). The 
pulverized material thus obtained is rinsed with water through a filter cloth (such as 
Miracloth Calbiochem, La Jolla, CA, USA). The filtrate is then left to stand for two 

hours for the starch granules to settle. The sediment is rinsed firstly with several 
volumes of water then a second time with several volumes of NaCl solution at 0.1 M. 
The sediment is filtered once again then rinsed twice with ethanol before being dried. 
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CLAIMS 

1. A recombinant nucleotide sequence, characterized in that it contains, in the 
5 f -»3 ! direction, a nucleotide sequence coding for an adenosine diphosphate glucose a- 
1,4-glucan oe-4-glucosyltransferase or starch synthase EC 2.4.1.21, or for a protein 
derived from this enzyme, especially by suppression, addition or substitution of one or 
more amino acids, the said enzyme or derived protein having the property of migrating 
to the sites of biosynthesis of the starch granules in plant cells and of attaching to the 
starch granules"; the said nucleotide sequence coding for the enzyme or aforementioned 
protein being positioned upstream of a nucleotide sequence coding for a peptide or 
polypeptide of interest. 

2. A recombinant nucleotide sequence according to Claim 1, characterized in 
that the nucleotide sequence coding for a starch synthase, or for a derived protein, codes 
for the starch synthase bound to the starch granule or GBSS present in particular in 
plants, algae or micro-algae, and more especially for the isoform GBSSI, or for a 
protein derived from GBSS as defined in Claim 1. 

3. A recombinant nucleotide sequence according to Claim 1 or 2, characterized 
in that the nucleotide sequence coding for a starch synthase, or for a derived protein, is 
selected from: 

- the nucleotide sequence SEQ ID NO : 1 of the cDNA coding for the GBSSI of 
Chlamydomonas reinhardtii, 

- or a fragment of the nucleotide sequence SEQ ID NO : 1 shown, such as the 
sequences in which the nucleotide of the 5' end corresponds to that located in one of the 
positions 1 to 186 of SEQ ID NO : 1, and in which the nucleotide of the 3' end 
corresponds to that located in one of the positions 1499 to 3117 of SEQ ID NO : 1, 
especially: 

. the sequence SEQ ID NO : 2 coding for the GBSSI of Chlamydomonas 
reinhardtii in the form of pre-protein of 708 amino acids (SEQ ID NO : 3), 

. the sequence SEQ ID NO : 4 coding for the GBSSI of Chlamydomonas 
reinhardtii in the form of mature protein of 651 amino acids (SEQ ID NO : 5), 

. the sequence SEQ ID NO : 6 coding for a fragment of 438 amino acids 
(SEQ ID NO : 7) of the GBSSI of Chlamydomonas reinhardtii, 



42 



. the sequence SEQ ID NO : 8 coding for a fragment of 531 amino acids 
(SEQ ID NO : 9) of the GBSSI of Chlamydomonas reinhardtii, 

- -or a nucleotide sequence derived by degeneration of the genetic code of the 
aforementioned nucleotide sequences, and coding for the aforementioned GBSSI of 
Chlamydomonas reinhardtii, or for an aforementioned peptide fragment of the latter, 

- or a nucleotide sequence derived from a nucleotide sequence or fragment 
mentioned above, especially by substitution, suppression or addition of one or more 
nucleotides, and coding for a peptide sequence derived from the aforementioned GBSSI 
of Chlamydomonas reinhardtii, or derived from an aforementioned peptide fragment of 
the latter, and having the property of attaching to the starch granules, the said derived 
nucleotide sequence preferably having a homology of at least about 50%, and preferably 
of at least about 70%, with the aforementioned nucleotide sequence or fragment, 

- or a nucleotide sequence capable of hybridizing with one of the 
aforementioned nucleotide sequences or fragments. 

4. A recombinant nucleotide sequence according to one of the Claims 1 to 3, 
characterized in that the nucleotide sequence coding for a peptide or polypeptide of 
interest is selected from: 

- those encoding biologically active peptides, especially peptides of therapeutic 
interest or that can be used in the agricultural and food industry, or 

- those encoding enzymes that are able to transform starch, such as enzymes that 
interact with oc-glucans including various hydrolases, phosphorylases, a- 1,4 
glucanotransferases, branching enzymes, amylases, and especially heat-resistant 
hydrolases obtained from extremophiles such as archaebacteria that are active at 
temperatures above 40°C. 

5. A recombinant nucleotide sequence according to one of the Claims 1 to 4, 
characterized in that it contains a nucleotide sequence encoding a cleavage site, the said 
nucleotide sequence being positioned between the nucleotide sequence coding for a 
starch synthase, or a protein derived from the latter, and the nucleotide sequence 
encoding the polypeptide of interest, 

6. Transgenic plant cells, selected from the cells of plants, algae or micro- 
algae, that are able to produce starch, the said cells containing a recombinant nucleotide 
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sequence according to one of the Claims 1 to 5 integrated in its genome or maintained 
in a stable manner in its cytoplasm. 

7. Transgenic plants, algae or micro-algae, or parts, especially flowers, fruits, 
5 leaves, stems, roots, seeds, or fragments of these plants, algae or micro-algae, 

containing a recombinant nucleotide sequence according to one of the Claims 1 to 5 
integrated in the genome or maintained in a stable manner in the cytoplasm of the cells 
of which they are composed. 

10 8. A fusion polypeptide, characterized in that it contains: 

- in the N-terminal position, a starch synthase, or a protein derived from this 
enzyme, especially by suppression, addition or substitution of one or more amino acids, 
the said starch synthase or derived protein having the property of migrating to the sites 
of biosynthesis of the starch granules in plant cells and of attaching to the starch 

15 granules, 

- and, in the C-terminal position, a peptide or polypeptide of interest, 

the C-terminal part of the amino acid sequence of the starch synthase, or of the 
derived protein, thus being bound to the N-terminal part of the peptide sequence of 
interest, the said fusion polypeptide being encoded by a recombinant nucleotide 
20 sequence according to one of the Claims 1 to 5. 

9. A fusion polypeptide according to Claim 8, characterized in that the starch 
synthase is selected from: 

- the peptide sequence SEQ ID NO : 3 corresponding to the GBSSI of 
25 Chlamydomonas reinhardtii in the form of pre-protein of 708 amino acids, 

- or a fragment of the peptide sequence SEQ ID NO : 3, such as the sequences in 
which the amino acid of the amino terminal end corresponds to that located in one of 
the positions 1 to 58 of SEQ ID NO : 3, and in which the amino acid of the carboxy 
terminal end corresponds to that located in one of the positions 495 to 708 of SEQ ID 

30 NO : 3, especially: 

. the sequence SEQ ID NO : 5 corresponding to the GBSSI of 
Chlamydomonas reinhardtii in the form of mature protein of 651 amino acids, 

. the sequence SEQ ID NO : 7 corresponding to a fragment of 438 amino 
acids of the peptide sequence of the GBSSI of Chlamydomonas reinhardtii, 
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. the sequence SEQ ID NO : 9 corresponding to a fragment of 531 amino 
acids of the peptide sequence of the GBSSI of Chlamydomonas reinhardtii, 
~ or a peptide sequence derived from an aforementioned peptide sequence or 
fragment, especially by substitution, suppression or addition of one or more amino 
acids, and having the property of attaching to the starch granules, the said derived 
peptide sequence preferably having a homology of at least about 60%, and 
advantageously at least about 80%, with the aforementioned peptide sequence or 
fragment. 

10. A fusion polypeptide according to Claim 8 or 9, characterized in that it 
contains a cleavage site positioned between, on the one hand, the starch synthase, or a 
protein derived from the latter, and, on the other hand, the polypeptide of interest. 

11. Starch granules, characterized in that they contain one or more fusion 
polypeptides defined in one of the Claims 8 to 10. 

12. A pharmaceutical composition, characterized in that it contains starch 
granules according to Claim 11, if necessary in combination with a physiologically 
acceptable vehicle, the said granules containing one or more fusion polypeptides as 
defined in one of the Claims 8 to 10, the peptide of interest in the said fusion 
polypeptides possessing a defined therapeutic effect. 

13. A pharmaceutical composition according to Claim 12, characterized in that 
it is in a form that can be administered parenterally, especially intravenously, or in a 
form that can be administered orally, the diameter of the starch granules being between 
about 0.1 |Ltm and several tens of jum, and the proportion by weight of the fusion 
polypeptides in these granules being between about 0.1% and 1%. 

14. A pharmaceutical composition, characterized in that it contains one or more 
fusion polypeptides as defined in one of the Claims 8 to 10, if necessary in combination 
with a physiologically acceptable vehicle, the peptide of interest in the said fusion 
polypeptides possessing a defined therapeutic effect. 
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15. A food composition, characterized in that it contains starch granules 
according to Claim 11, the said granules containing one or more fusion polypeptides as 
defined in one of the Claims 8 to 10, the peptide of interest in the said fusion 
polypeptides being usable in the food-processing field. 

16. A method of preparation of starch granules according to Claim 11, 
characterized in that it comprises the following stages; 

- transformation of plant cells, by means of a cellular host, such as 
Agrobacterium tumefaciens, transformed by a recombinant vector, especially of the 
plasmid, cosmid or phage type, containing a recombinant nucleotide sequence 
according to one of the Claims 1 to 5, 

- obtaining plants, algae or micro-algae transformed in such a way that their 
genome contains one or more nucleotide sequences according to one of the Claims 1 to 
5, by in vitro culture of the aforementioned transformed host cells, 

- if necessary, fertilization and recovery of the seeds of the plants obtained in the 
preceding stage, and cultivation of these seeds to obtain plants of the next generation, 

- extraction of the starch granules from the plants, algae or micro-algae, or from 
parts, especially flowers, fruits, leaves, stems, roots, or fragments of these 
aforementioned transformed plants, algae or micro-algae, especially by sedimentation. 

17. A method of preparation of fusion polypeptides according to one of the 
Claims 8 to 10, characterized in that it includes the implementation of the method 
according to Claim 16, the said method comprising an additional stage of recovery, and 
if necessary of purification, of the fusion polypeptides from the starch granules. 

18. A method of preparation of a peptide of interest, characterized in that it 
includes the implementation of the method according to Claim 16 or Claim 17, the said 
method being carried out by transformation of host cells with the nucleotide sequences 
according to Claim 5, and includes an additional stage of cleavage of the fusion 
polypeptide obtained, by means of a suitable reagent, then, if necessary, a stage of 
purification of the polypeptide of interest. 

19. A method of biotransformation of starch granules, characterized in that it 
comprises the following stages: 
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- transformation of plant cells, by means of a cellular host, such as 
Agrobacterium tumefaciens, transformed by a recombinant vector, especially of the 
plasmid, cosmid or phage type, containing a recombinant nucleotide sequence 
according to Claim 4, encoding enzymes that are able to transform starch, ~ 

- obtaining plants, algae or micro-algae transformed in such a way that their 
genome contains one or more aforementioned nucleotide sequences, by in vitro culture 
of the aforementioned transformed host cells, 

- if necessary, fertilization and recovery of the seeds of the plants obtained in the 
preceding stage, and cultivation of these seeds to obtain plants of the next generation, 

- extraction of the starch granules from the plants, algae or micro-algae, or from 
parts, especially flowers, fruits, leaves, stems, roots, or fragments of these 
aforementioned transformed plants, algae or micro-algae, especially by sedimentation, 

- if necessary, heating of the said starch granules to a temperature at which the 
peptide of interest of the fusion polypeptide is capable of being active. 



ABSTRACT 



STARCH GRANULES CONTAINING A RECOMBINANT POLYPEPTIDE OF 
INTEREST, A METHOD OF OBTAINING THEM, AND THEIR USES 



The present invention relates to starch granules containing a fusion polypeptide 
between a starch synthase and a recombinant polypeptide of interest, the nucleotide 
sequences used for obtaining them, their methods of preparation, as well as their uses, 
especially in pharmaceutical compositions. 

(no drawing) 
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<210> 1 
<211> 3117 
<212> DNA 

<213> Chlamydomonas reinhardtii 
<400> 1 

gccaggcgga gcgtatggct gttgcctcta ccagccgccc cagcagcgcg cgtcctatcg 60 

tcatcaatgc cgcgtcgttc ggtgtcaaga agaccgcgaa ccagctgctg cgtgagcttg 120 

ctcgtggctc cgcacgcaag tccacctcgc gctcggctgt tactggtgcc actggtgcca 180 

cttgcgcgct ggacatcgtg atggttgctg ctgaggtcgc cccttggtcc aagacgggcg 240 

gcctgggcga tgtgactggt ggcctgccta ttgagctggt caagcgcggc caccgcgtca 300 

tgaccattgc ccctcgctac gaccagtacg ctgacgcctg ggacacctcg gtggtcgtgg 360 * 

acatcatggg cgagaaggtc cgctacttcc actccatcaa gaagggcgtg caccgcgtgt 420 

ggattgacca cccctggttc ctggccaagg tctggggcaa gaccggctcc aagctgtacg 480 

gcccccgctc cggcgctgac tacctggaca accacaagcg cttcgccctg ttctgcaagg 54 0 

ccgctattga ggctgcccgc gtgctgccct tcggccccgg cgaggactgc gtcttcgtgg 600 

ccaacgactg gcactccgcc ctggtgcccg tcctgctgaa ggacgagtac cagcccaagg 660 

gccagttcac caaggccaag tcggtgctgg ctatccacaa catcgccttc cagggccgca 720 

tgtgggagga ggctttcaag gacacgaagc tgccccaggc cgcctttgac aagctggcct 780 

tctcggacgg ctatgccaag gtttacactg aggccacccc catggaggag gacgagaagc 840 

ccccgctgac gggaaagacc tacaagaaga tcaactggct gaagggtggc attatcgccg 900 

ccgacaagct ggtgactgtg tcgcccaact acgcgaccga gatcgctgcc gatgccgccg 960 

gcggtgtgga gctggacacc gtcatccgcg ccaagggcat tgagggcatt ,gtgaacggca 1020 

tggacattga ggagtggaac cccaagaccg acaagttcct gtctgcgccc '"tacgaccaga 1080 

acagcgtcta cgccggcaag gccgccgcca aggaggccct gcaggccgag ctgggcctgc 1140 

ctgtggaccc caccgccccc ctgttcgcct tcatcggccg cctggaggag cagaagggtg 1200 

tggacatcat cctggccgcc ctgcccaaga tcctggccac ccccaaggtg cagatcgcca 1260 

tcctgggtac cggcaaggcc gcctacgaga agctggtgaa cgccatcggc accaagtaca 1320 

agggccgcgc caagggcgtg gtcaagttct cggcgcccct ggcgcacatg ctcaccgccg 1380 

gcgccgactt catgctggtg ccctcgcgct tcgagccctg cggcctgatc cagctgcacg 1440 

ccatgcacta cggtaccgtg cccgtggtag cctccaccgg cggcctggtc gacaccgtca 1500 

aggagggcgt caccggcttc cacatgggcg ccctgaaccc cgacaagctg gacgaggctg 1560 

acgccgacgc cctggccgcc accgtgcgcc gtgccagcga ggtgtttgcg ggcggccgct 1620 

accccgagat ggtggccaac tgcatcagcc aggacctgtc ctggtccaag cccgcccaga 1680 

agtgggaggg cctgctggag gaggtggtgt acggcaaggg cggcgtggcc accgccaaga 1740 

aggaggagat caaggtgccc gttgccgaga agatccccgg cgacctgccc gccgtgtcct 1800 

acgcccccaa caccctgaag cccgtgtccg cctccgtgga gggcaacggc gccgccgcgc 18 60 

ccaaggtcgg caccaccgcc cccgccatgg gcgcgtggcg cgcgaccacc ccctcgggcc 1920 

cctcgcccgc cgccgccacc cccaaggtga ccapctacaa gcccgccctg cccgccaccg 1980 

ccaagcccaa gaccgctggc ctcaagctgg ccggtgaggc ctccaccacc tcgacctcgg 2040 

agaacggcgc tgcctccaac ggcaacggca acggtgcctc ggcctccaag acctcggctg 2100 
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ccaagcccct 
gcggcgacag 
gcggccgcca 
gagcatggca 
tgaagtcggt 
gtttcggttt 
tgcgcggcgc 
tcggcggcaa 
gagagtttgc 
cctccggcaa 
ggatacataa 
ggttttgttg 
cttgggctca 
ggctcgaagc 
cttgactgga 
acccagtagt 
tgagagagta 



ggtctccgcc 
catgagcggc 
ccggcgagga 
ggcggaaacg 
tttgcgagac 
tgcccaagtc 
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aaaaaaaaaa 



aagcggcagt 
aacggctgta 
gcgatgagct 
atgagagtgt 
tttgtagtgc 
tgtagcagta 
cctgccctca 
aggatttttg 
cgcgccgggc 
accccccaat 
ggcctgggtc 
ccctcgcaag 
gccagcggtt 
tggagaaaga 
gaggcattgg 
gagtaccgcg 
aaaaaaaaaa 



agccgcagag 
gcagcggcag 
tagcggccgt 
cgtgctgtaa 
agggcctgtg 
gcagagcact 
gccttgtgat 
caggctgcct 
tctcctaccg 
gtagaggatg 
gcccctcgat 
gcccgggtgc 
ttttgaggct 
ggcggacatg 
cggactattg 
cattaatccg 
aaaaaaa 



2160 
2220 
2280 
2340 
2400 
2460 
'2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3117 



<210> 2 
<211> 2124 
<212> DNA 

<213> Artificial sequence 

<220> 

<223> Description of the artificial sequence: fragment 

of the complete sequence of cDNA coding for the GBSSI 
of Chlarnydomonas reinhardtii 

<220> 

<221> CDS 

<222> (1) . . (2124) 



<400> 2 

atg get gtt gec tct acc age cgc ccc age age gcg cgt cct ate gtc 48 

Met Ala Val Ala Ser Thr Ser Arg Pro Ser Ser Ala Arg Pro lie Val 

1' 5 10 15 

ate aat gee gcg teg ttc ggt gtc aag aag acc gcg aac cag ctg ctg 96 

lie Asn Ala Ala Ser Phe Gly Val Lys Lys Thr Ala Asn Gin Leu Leu 

20 25 30 



cgt gag ctt get cgt ggc tec gca cgc aag tec acc teg cgc teg get 144 
Arg Glu Leu Ala Arg Gly Ser Ala Arg Lys Ser Thr Ser Ar^ Ser Ala 
35 40 45 



gtt act ggt gee act ggt gec act tgc gcg ctg gac ate gtg atg gtt 192 
Val Thr Gly Ala Thr Gly Ala Thr Cys Ala Leu Asp He Val Met Val 
50 55 60 



get get gag gtc gee cct tgg tec aag acg ggc ggc ctg ggc gat gtg 240 
Ala Ala Glu Val Ala Pro Trp Ser Lys Thr Gly Gly Leu Gly Asp Val 
65 70 75 80 



act ggt ggc ctg cct att gag ctg gtc aag cgc ggc cac cgc gtc atg 288 
Thr Gly Gly Leu Pro He Glu Leu Val Lys Arg Gly His Arg Val Met 
85 90 95 



acc att gec cct cgc tac gac cag tac get gac gec tgg gac acc teg 336 
Thr He Ala Pro Arg Tyr Asp Gin Tyr Ala Asp Ala Trp Asp Thr Ser 
100 105 110 



3 



gtg gtc gtg gac ate atg ggc gag aag gtc cgc tac ttc cac tec ate 
Val Val Val Asp He Met Gly Glu Lys Val Arg Tyr Phe His Ser He 
115 120 125 



38' 



aag aag ggc gtg cac cgc gtg tgg att gac cac ccc tgg ttc ctg gec 432 
Lys Lys Gly Val His Arg Val Trp He Asp His Pro Trp Phe Leu Ala 
130 135 140 

aag gtc tgg ggc aag acc ggc tec aag ctg tac ggc ccc cgc tec ggc 480 
Lys Val Trp Gly Lys Thr Gly Ser Lys Leu Tyr Gly Pro Arg Ser Gly 
145 150 155 160 

get gac tac ctg gac aac cac aag cgc ttc gec ctg ttc tgc aag gee 528 
Ala Asp Tyr Leu Asp Asn His Lys Arg Phe Ala Leu Phe Cys Lys Ala 
165 170 175 

get att gag get gec cgc gtg ctg ccc ttc ggc ccc ggc gag gac tgc 576 
Ala He Glu Ala Ala Arg Val Leu Pro Phe Gly Pro Gly Glu Asp Cys 
180 185 190 

gtc ttc gtg gec aac gac tgg cac tec gee ctg gtg ccc gtc ctg ctg 624 
Val Phe Val Ala Asn Asp Trp His Ser Ala Leu Val Pro Val Leu Leu 
195 200 205 

aag gac gag tac cag ccc aag ggc cag ttc acc aag gec aag teg gtg 672 
Lys Asp Glu Tyr Gin Pro Lys Gly Gin Phe Thr Lys Ala Lys Ser Val 
210 215 220 



ctg get ate cac aac ate gee ttc cag ggc cgc atg tgg gag gag get 
Leu Ala He His Asn He Ala Phe Gin Gly Arg Met Trp Glu Glu Ala 



225 230 235 



240 



720 



ttc aag gac acg aag ctg ccc cag gee gee ttt gac aag ctg gee ttc 768 
Phe Lys Asp Thr Lys Leu Pro Gin Ala Ala Phe Asp Lys Leu Ala Phe 
245 250 255 

teg gac ggc tat gee aag gtt tac act gag gee acc ccc atg gag gag 816 
Ser Asp Gly Tyr Ala Lys Val Tyr Thr Glu Ala Thr Pro Met Glu Glu 
260 265 270 

gac gag aag ccc ccg ctg acg gga aag acc tac aag aag ate aac tgg 864 
Asp Glu Lys Pro Pro Leu Thr Gly Lys Thr Tyr Lys Lys He Asn Trp 
275 280 285 

ctg aag ggt ggc att ate gec gec gac aag ctg gtg act gtg teg ccc 912 
Leu Lys Gly Gly He He Ala Ala Asp Lys Leu Val Thr Val Ser Pro 
290 295 300 

aac tac gcg acc gag ate get gec gat gee gee ggc ggt gtg gag ctg 960 
Asn Tyr Ala Thr Glu He Ala Ala Asp Ala Ala Gly Gly Val Glu Leu 
'305 310 315 ' 320 

gac acc gtc ate cgc gee aag ggc att gag ggc att gtg aac ggc atg 1008 
Asp Thr Val He Arg Ala Lys Gly He Glu Gly He Val Asn Gly Met 
325 330 335 

gac att gag gag tgg aac ccc aag acc gac aag ttc ctg tct gcg ccc 1056 
Asp He Glu Glu Trp Asn Pro Lys Thr Asp Lys Phe Leu Ser Ala' Pro 
340 345 350 
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tac gac cag 
Tyr Asp Gin 
355 

ctg cag gcc 
Leu Gin Ala 
370 

gcc ttc ate 
Ala Phe He 
385 

gcc gcc ctg 
Ala Ala Leu 



ctg ggt acc 
Leu Gly Thr 



acc aag tac 
Thr Lys Tyr 
435 

ctg gcg cac 
Leu Ala His 
450 

cgc ttc gag 
Arg Phe Glu 
465 

acc gtg ccc 
Thr Val Pro 



gag ggc gtc 
Glu Gly Val 



gac gag get 
Asp Glu Ala 
515 

gag gtg ttt 
Glu Val Phe 
530 

age cag gac 
Ser Gin Asp 
545 

ctg gag gag 
Leu Glu Glu 



gag gag ate 
Glu Glu He 



aac age gtc 
Asn Ser Val 



gag ctg ggc 
Glu Leu Gly 



ggc cgc ctg 
Gly Arg Leu 
390 

ccc aag ate 
Pro Lys lie 
405 

ggc aag gcc 
Gly Lys Ala 
420 

aag ggc cgc 
Lys Gly Arg 



atg etc acc 
Met Leu Thr 



ccc tgc ggc 
Pro Cys Gly 
470 

gtg gta gcc 
Val Val Ala 
485 

acc ggc ttc 
Thr Gly Phe 
500 

gac gcc gac 
Asp Ala Asp 



gcg ggc ggc 
Ala Gly Gly 



ctg tec tgg 
Leu Ser Trp 
550 

gtg gtg tac 
Val Val Tyr 
565 

aag gtg ccc 
Lys Val Pro 
580 



tac gcc ggc 
Tyr Ala Gly 
360 

ctg cct gtg 
Leu Pro Val 
375 

gag gag cag 
Glu Glu Gin 



ctg gcc acc 
Leu Ala Thr 



gcc tac gag 
Ala Tyr Glu 
425 

gcc aag ggc 
Ala Lys Gly 
440 

gcc ggc gcc 
Ala Gly Ala 
455 

ctg ate cag 
Leu He Gin 



tec acc ggc 
Ser Thr Gly 



cac atg ggc 
His Met Gly 
505 

gcc ctg gcc 
Ala Leu Ala 
520 

cgc tac ccc 
Arg Tyr Pro 
535 

tec aag ccc 
Ser Lys Pro 



ggc aag ggc 
Gly Lys Gly 



gtt gcc gag 
Val Ala Glu 
585 



aag gcc gcc 
Lys Ala Ala 



gac ccc acc 
Asp Pro Thr 
380 

aag ggt gtg 
Lys Gly Val 
395 

ccc aag gtg 
Pro Lys Val 
410 

aag ctg gtg 
Lys Leu Val 



gtg gtc aag 
Val Val Lys 



gac ttc atg 
Asp Phe Met 
460 

ctg cac gcc 
Leu His Ala 
475 

ggc ctg gtc 
Gly Leu Val 
490 

gcc ctg aac 
Ala Leu Asn 



gcc acc gtg 
Ala Thr Val 



gag atg gtg 
Glu Met Val 
540 

gcc cag aag 
Ala Gin Lys 
555 

ggc gtg gcc 
Gly Val Ala 
570 

aag ate ccc 
Lys He Pro 



gcc aag gag 
Ala Lys Glu 
365 

gcc ccc ctg 
Ala Pro Leu 



gac ate ate 
Asp He lie 



cag ate gcc 
Gin He Ala 
415 

aac gee ate 
Asn Ala He 
430 

ttc teg gcg 
Phe Ser Ala 
445 

ctg gtg ccc 
Leu Val Pro 



atg cac tac 
Met His Tyr 



gac acc gtc 
Asp Thr Val 
495 

ccc gac aag 
Pro Asp Lys 
510 

cgc cgt gcc 
Arg Arg Ala 
525 

gcc aac tgc 
Ala Asn Cys 



tgg gag ggc 
Trp Glu Gly 



acc gcc aag 
Thr Ala Lys 
575 

ggc gac ctg 
Gly Asp Leu 
590 



gcc 1104 
Ala 



ttc 1152 
Phe 



ctg 1200 

Leu 

400 

ate 1248 
He 



ggc 1296 
Gly 



ccc 1344 
Pro 



teg 1392 
Ser 



ggt 1440 

Gly 

480 

aag 1488 
Lys 



ctg 1536 
Leu 



age 1584 
Ser 



ate 1632 
He 



ctg 1680 

Leu 

560 

aag 1728 
Lys 



ccc 1776 
Pro 
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gcc gtg tec 
Ala Val Ser 
595 

gag ggc aac 
Glu Gly Asn 
610 

atg ggc gcg 
Met Gly Ala 
625 

gcc acc ccc 
Ala Thr Pro 



aag ccc aag 
Lys Pro Lys 



teg acc teg 
Ser Thr Ser 
675 

teg gcc tec 
Ser Ala Ser 
690 

cgc aag tec 
Arg Lys Ser 
705 



tac gcc ccc 
Tyr Ala Pro 



ggc gcc gcc 
Gly Ala Ala 



tgg cgc gcg 
Trp Arg Ala 
630 

aag gtg acc 
Lys Val Thr 
€45 

acc get ggc 
Thr Ala Gly 
660 

gag aac ggc 
Glu Asn Gly 



aag acc teg 
Lys Thr Ser 



gcc 
Ala 



aac acc ctg 
Asn Thr Leu 
600 

gcg ccc aag 
Ala Pro Lys 
615 

acc acc ccc 
Thr Thr Pro 



acc tac aag 
Thr Tyr Lys 



etc aag ctg 
Leu Lys Leu 
665 

get gcc tec 
Ala Ala Ser 
680 

get gcc aag 
Ala Ala Lys 
695 



aag ccc gtg 
Lys Pro Val 



gtc ggc acc 
Val Gly Thr 
620 

teg ggc ccc 
Ser Gly Pro 
635 

ccc gcc ctg 
Pro Ala Leu 
650 

gcc ggt gag 
Ala Gly Glu 



aac ggc aac 
Asn Gly Asn 



ccc ctg gtc 
Pro Leu Val 
700 



tec gcc tec 
Ser Ala Ser 
605 

acc gcc ccc 
Thr Ala Pro 



teg ccc gcc 
Ser Pro Ala 



ccc gcc acc 
Pro Ala Thr 
655 

gcc tec acc 
Ala Ser Thr 
670 

ggc aac ggt 
Gly Asn Gly 
685 

tec gcc gcc 
Ser Ala Ala 



gtg 1824 
Val 



gcc 1872 
Ala 



gcc 1920 

Ala 

640 

gcc 1968 
Ala 



acc 2016 
Thr 



gcc 2064 
Ala 



acc 2112 
Thr 



2124 



<210> 3 
<211> 708 
<212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence: fragment 

of the complete sequence of cDNA coding for the GBSSI 
of Chlamydoiuonas reinhardtii 

<400> 3 

Met Ala Val Ala Ser Thr Ser Arg Pro Ser Ser Ala Arg Pro lie Val 
1 5 10 ' 15 

He Asn Ala Ala Ser Phe Gly Val Lys Lys Thr Ala Asn Gin Leu Leu 
20 25 30 

Arg Glu Leu Ala Arg Gly Ser Ala Arg Lys Ser Thr Ser Arg Ser Ala 
35 40 45 

Val Thr Gly Ala Thr Gly Ala Thr Cys Ala Leu Asp He Val Met Val 
50 55 60 

Ala Ala Glu Val Ala Pro Trp Ser Lys Thr Gly Gly Leu Gly Asp Val 
65 70 75 80 



Thr Gly Gly Leu Pro He Glu Leu Val Lys Arg Gly His Arg Val Met 
85 „ 90 95 
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Thr lie Ala Pro 
100 

Val Val Val Asp 
115 

Lys Lys Gly Val 
130 

Lys Val Trp Gly 
145 

Ala Asp Tyr Leu 



Ala lie Glu Ala 
180 

Val Phe Val Ala 
195 

Lys Asp Glu Tyr 
210 

Leu Ala lie His 
225 

Phe Lys Asp Thr 



Ser Asp Gly Tyr 
260 

Asp Glu Lys Pro 
275 

Leu Lys Gly Gly 
290 

Asn Tyr Ala Thr 
305 

Asp Thr Val lie 



Asp lie Glu Glu 
340 

Tyr Asp Gin Asn 
355 

Leu Gin Ala Glu 
370 

Ala Phe lie Gly 
385 

Ala Ala Leu Pro 



Arg Tyr Asp Gin 



He Met Gly Glu 
120 

His Arg Val Trp 
135 

Lys Thr Gly Ser 
150 

Asp Asn His Lys 
165 

Ala Arg Val Leu 



Asn Asp Trp His 
200 

Gin Pro Lys Gly 
215 

Asn He Ala Phe 
230 

Lys Leu Pro Gin 
245 

Ala Lys Val Tyr 



Pro Leu Thr Gly 
280 

He He Ala Ala 
295 

Glu He Ala Ala 
310 

Arg Ala Lys Gly 
325 

Trp Asn Pro Lys 



Ser Val Tyr Ala 
360 

Leu Gly Leu Pro 
375 

Arg Leu Glu Glu 
390 

Lys He Leu Ala 
405 



Tyr Ala Asp Ala 
105 

Lys Val Arg Tyr 



He Asp His Pro 
140 

Lys Leu Tyr Gly 
155 

Arg Phe Ala Leu 
170 

Pro Phe Gly Pro 
185 

Ser Ala Leu Val 



Gin Phe Thr Lys 
220 

Gin Gly Arg Met 
235 

Ala Ala Phe Asp 
250 

Thr Glu Ala Thr 
265 

Lys Thr Tyr Lys 



Asp Lys Leu Val 
300 

Asp Ala Ala Gly 
315 

He Glu Gly He 
330 

Thr Asp Lys Phe 
345 

Gly Lys Ala Ala 



Val Asp Pro Thr 
380 

Gin Lys Gly Val 
395 

Thr Pro Lys Val 
,410 



Trp Asp Thr Ser 
110 

Phe His Ser He 
125 

Trp Phe Leu Ala 



Pro Arg Ser Gly 
160 

Phe Cys Lys Ala 
175 

Gly Glu Asp Cys 
190 

Pro Val Leu Leu 
205 

Ala Lys Ser Val 



Trp Glu Glu Ala 
240 

Lys Leu Ala Phe 
255 

Pro Met Glu Glu 
270 

Lys He Asn Trp 
285 

Thr Val Ser Pro 



Gly Val Glu Leu 
320 

Val Asn Gly Met 
' 335 

Leu Ser Ala Pro 
350 

Ala Lys Glu Ala 
365 

Ala Pro Leu Phe 



Asp He He Leu 
400 

Gin He Ala He 

415 
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Leu Gly Thr Gly Lys Ala Ala Tyr Glu Lys Leu Val Asn Ala He Gly 



420 



425 



430 



Thr Lys Tyr Lys Gly Arg Ala Lys Gly Val Val Lys Phe Ser Ala Pro 
435 440 445 

Leu Ala His Met Leu Thr Ala Gly Ala Asp Phe Met Leu Val Pro Ser 
450 455 460 

Arg Phe Glu Pro Cys Gly Leu He Gin Leu His Ala Met His Tyr Gly 
465 470 475 480 

Thr Val Pro Val Val Ala Ser Thr Gly Gly Leu Val Asp Thr Val Lys 
485 490 495 

Glu Gly Val Thr Gly Phe His Met Gly Ala Leu Asn Pro Asp Lys Leu 
500 505 510 

Asp Glu Ala Asp Ala Asp Ala Leu Ala Ala Thr Val Arg Arg Ala Ser 
515 520 525 

Glu Val Phe Ala Gly Gly Arg Tyr Pro Glu Met Val Ala Asn Cys He 
530 535 540 

Ser Gin Asp Leu Ser Trp Ser Lys Pro Ala Gin Lys Trp Glu Gly Leu 
545 550 555 560 

Leu Glu Glu Val Val Tyr Gly Lys Gly Gly Val Ala Thr Ala Lys Lys 
565 570 575 

Glu Glu He Lys Val Pro Val Ala Glu Lys He Pro Gly Asp Leu Pro 
580 585 590 

Ala Val Ser Tyr Ala Pro Asn Thr Leu Lys Pro Val Ser Ala Ser Val 
595 600 605 

Glu Gly Asn Gly Ala Ala Ala Pro Lys Val Gly Thr Thr Ala Pro Ala 
610 615 620 

Met Gly Ala Trp Arg Ala Thr Thr Pro Ser Gly Pro Ser Pro Ala Ala 
625 630 635 640 

Ala Thr Pro Lys Val Thr Thr Tyr Lys Pro Ala Leu Pro Ala Thr Ala 
645 650 ' 655 

Lys Pro Lys Thr Ala Gly Leu Lys Leu Ala Gly Glu Ala Ser Thr Thr 
660 665 670 

Ser Thr Ser Glu Asn Gly Ala Ala Ser Asn Gly Asn Gly Asn Gly Ala 
675 680 685 

Ser Ala Ser Lys Thr Ser Ala Ala Lys Pro Leu Val Ser Ala Ala Thr 
690 695 700 



Arg Lys Ser Ala 
705 
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<210> 4 
<211> 1953 
<212> DNA 

<213> Artificial sequence 



<220> 

<223> Description of the artificial sequence: fragment 
of the complete cDNA coding for the GBSSI of 
Chlamydomonas reinhardtii and coding for the mature GBSSI 
protein. 



<220> 

<221> CDS 

<222> (1) . . (1953) 



<400> 4 

gcg ctg gac ate gtg atg gtt get get gag gtc gec cct tgg tec aag 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 

1 *5 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro He Glu Leu Val 

20 25 30 



48 



96 



aag cgc ggc cac cgc gtc atg ace att gec cct cgc tac gac cag tac 
Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

get gac gec tgg gac ace teg gtg gtc gtg gac ate atg ggc gag aag 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 60 

gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 
Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp He 
65 70 75 80 

gac cac ccc tgg ttc ctg gee aag gtc tgg ggc aag ace ggc tec aag 
Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 HO 

ttc gee ctg ttc tgc aag gec get att gag get gee cgc gtg ctg ccc 
Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 

ttc ggc ccc ggc gag gac tgc gtc ttc gtg gee aac gac tgg cac tec 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

gec ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

ttc ace aag gec aag teg gtg ctg get ate cac aac ate gec ttc cag 
Phe Thr Lys Ala Lys Ser Val Leu Ala He His Asn He Ala Phe Gin 
165 170 175 



144 



192 



240 



288 



336 



384 



432 



480 



528 
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ggc cgc atg tgg gag gag get ttc aag gac acg aag ctg ccc cag gec 576 
Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

gec ttt gac aag ctg gec ttc teg gac ggc tat gec aag gtt tac act . 624 
Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

gag gec acc ccc atg gag gag gac gag aag ccc ccg ctg acg gga aag 672 
Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

acc tac aag aag ate aac tgg ctg aag ggt ggc att ate gec gee gac 720 
Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 - 230 235 - 240 

aag ctg gtg act gtg teg ccc aac tac gcg acc gag ate get gee gat 7 68 
Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

gec gec ggc ggt gtg gag ctg gac acc gtc ate cgc gee aag ggc att 816 
Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly lie 
260 265 270 

gag ggc att gtg aac ggc atg gac att gag gag tgg aac ccc aag acc 864 
Glu Gly lie Val Asn Gly Met Asp lie Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

gac aag ttc ctg tct gcg ccc tac gac cag aac age gtc tac gee ggc 912 
Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

aag gee gec gec aag gag gee ctg cag gee gag ctg ggc ctg cct gtg 960 
Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

gac ccc acc gee ccc ctg ttc gee ttc ate ggc cgc ctg gag gag cag 1008 
Asp Pro Thr Ala Pro Leu Phe Ala Phe lie Gly Arg Leu Glu Glu Gin 
325 330 335 

aag ggt gtg gac ate ate ctg gec gee ctg ccc aag ate ctg gee acc 1056 
Lys Gly Val Asp lie lie Leu Ala Ala Leu Pro Lys lie Leu Ala Thr 
340 345 356 

ccc aag gtg cag ate gec ate ctg ggt acc ggc aag gee gee tac gag 1104 
Pro Lys Val Gin lie Ala lie Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

aag ctg gtg aac gee ate ggc acc aag tac aag ggc cgc gee aag ggc 1152 
Lys Leu Val Asn Ala lie Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

gtg gtc aag ttc teg gcg ccc ctg gcg cac atg etc acc gec ggc gec 1200 
Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

gac ttc atg ctg gtg ccc teg cgc ttc gag ccc tgc ggc ctg ate cag 1248 
Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu lie Gin 
405 410 415 
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ctg cac gcc atg cac tac ggt acc gtg ccc gtg gta gcc tec acc ggc 1296 
Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

ggc ctg gtc gac acc gtc aag gag ggc gtc acc ggc ttc cac atg ggc 1344 
Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 
435 440 445 

gcc ctg aac ccc gac aag ctg gac gag get gac gcc gac gcc ctg gcc 1392 
Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 

gcc acc gtg cgc cgt gcc age gag gtg ttt gcg ggc ggc cgc tac ccc 144 0 
Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 - 470 475 480 

gag atg gtg gcc aac tgc ate age cag gac ctg tec tgg tec aag ccc 1488 
Glu Met Val Ala Asn Cys lie Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

gcc cag aag tgg gag ggc ctg ctg gag gag gtg gtg tac ggc aag ggc 1536 
Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 510 

ggc gtg gcc acc gcc aag aag gag gag ate aag gtg ccc gtt gcc gag 1584 
Gly Val Ala Thr Ala Lys Lys Glu Glu lie Lys Val Pro Val Ala Glu 
515 520 525 

aag ate ccc ggc gac ctg ccc gcc gtg tec tac gcc ccc aac acc ctg 1632 
Lys He Pro Gly Asp Leu Pro Ala Val Ser Tyr Ala Pro Asn Thr Leu 
530 535 540 

aag ccc gtg tec gcc tec gtg gag ggc aac ggc gcc gcc gcg ccc aag 1680 
Lys Pro Val Ser Ala Ser Val Glu Gly Asn Gly Ala Ala Ala Pro Lys 
545 550 555 560 

gtc ggc acc acc gcc ccc gcc atg ggc gcg tgg cgc gcg acc acc ccc 1728 
Val Gly Thr Thr Ala Pro Ala Met Gly Ala Trp Arg Ala Thr Thr Pro 
565 570 575 

teg ggc ccc teg ccc gcc gcc gcc acc ccc aag gtg acc acc tac aag 1776 
Ser Gly Pro Ser Pro Ala Ala Ala Thr Pro Lys Val Thr Thr Tyr Lys 
580 585 590 

ccc gcc ctg ccc gcc acc gcc aag ccc aag acc get ggc etc aag ctg 1824 
Pro Ala Leu Pro Ala Thr Ala Lys Pro Lys Thr Ala Gly Leu Lys Leu 
595 600 605 

gcc ggt gag gcc tec acc acc teg acc teg gag aac ggc get gcc tec 1872 
Ala Gly Glu Ala Ser Thr Thr Ser Thr Ser Glu Asn Gly Ala Ala Ser 
610 615 620 

aac ggc aac ggc aac ggt gcc teg gcc tec aag acc teg get gcc aag 1920 
Asn Gly Asn Gly Asn Gly Ala Ser Ala Ser Lys Thr Ser Ala Ala Lys 
625 630 635 640 

ccc ctg gtc tec gcc gcc acc cgc aag tec gcc 1953 
Pro Leu Val Ser Ala Ala Thr Arg Lys £er Ala 
645 650 
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<210> 5 
<211> 651 
<212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence: fragment 
of the complete cDNA coding for the GBSSI 
of Chlamydomonas reinhardtii and coding for the mature 
GBSSI protein 

<400> 5 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro He Glu Leu Val 
20 "■ 25 30 

Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 ' 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp He 
65 70 75 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 

Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

Phe Thr Lys Ala Lys Ser Val Leu Ala He His Asn He Ala Phe Gin 
165 170 175 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

Thr Tyr Lys Lys He Asn Trp Leu Lys Gly Gly He He Ala Ala Asp 
225 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu He Ala Ala Asp 
245 250 255 



Ala Ala Gly Gly Val Glu Leu Asp Thr Val He Arg Ala Lys Gly He 
260 265 270 
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Glu Gly lie Val 
275 

Asp Lys Phe Leu 

290 

Lys Ala Ala Ala 
305 

Asp Pro Thr Ala 



Lys Gly Val Asp 
340 

Pro Lys Val Gin 
355 

Lys Leu Val Asn 
370 

Val Val Lys Phe 
385 

Asp Phe Met Leu 



Leu His Ala Met 
420 

Gly Leu Val Asp 
435 

Ala Leu Asn Pro 
450 

Ala Thr Val Arg 
465 

Glu Met Val Ala 



Ala Gin Lys Trp 
500 

Gly Val Ala Thr 
515 

Lys lie Pro Gly 
530 

Lys Pro Val Ser 
545 

Val Gly Thr Thr 



Ser Gly Pro Ser 
580 



Asn Gly Met Asp 
280 

Ser Ala Pro Tyr 
295 

Lys Glu Ala Leu 
310 

Pro Leu Phe Ala 
325 

lie lie Leu Ala 



lie Ala lie Leu 
360 

Ala lie Gly Thr 
375 

Ser Ala Pro Leu 
390 

Val Pro Ser Arg 
405 

His Tyr Gly Thr 



Thr Val Lys Glu 
440 

Asp Lys Leu Asp 
455 

Arg Ala Ser Glu 
470 



Asn Cys lie Ser 
485 

Glu Gly Leu Leu 



Ala Lys Lys Glu 
520 

Asp Leu Pro Ala 
535 

Ala Ser Val Glu 
550 

Ala Pro Ala Met 
565 

Pro Ala Ala Ala 



lie Glu Glu Trp 



Asp Gin Asn Ser 
300 

Gin Ala Glu Leu 
315 

Phe lie Gly Arg 
330 

Ala Leu Pro Lys 
345 

Gly Thr Gly Lys 



Lys Tyr Lys Gly 
380 

Ala His Met Leu 
395 

Phe Glu Pro Cys 
410 

Val Pro Val Val 
425 

Gly Val, Thr Gly 



Glu Ala Asp Ala 
460 

Val Phe Ala Gly 
475 

Gin Asp Leu Ser 
490 

Glu Glu Val Val 
505 

Glu lie Lys Val 



Val Ser Tyr Ala 
540 

Gly Asn Gly Ala 
555 

Gly Ala Trp Arg 
570 

Thr Pro Lys Val 
585 



Asn Pro Lys Thr 
285 

Val Tyr Ala Gly 



Gly Leu Pro Val 
320 

Leu Glu Glu Gin 
335 

lie Leu Ala Thr 
350 

Ala Ala Tyr Glu 
365 

Arg Ala Lys Gly 



Thr Ala Gly Ala 
400 

Gly Leu He Gin 
415 

Ala Ser Thr Gly 
430 

Phe His Met Gly 
445 

Asp Ala Leu Ala 



Gly Arg Tyr Pro 
480 

Trp Ser Lys Pro 
495 

Tyr Gly Lys Gly 
510 

Pro Val Ala Glu 
525 

Pro Asn Thr Leu 



Ala Ala Pro Lys 
560 

Ala Thr Thr Pro 
575 

Thr Thr Tyr Lys 
590 
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Pro Ala Leu Pro Ala Thr Ala Lys Pro Lys Thr Ala Gly Leu Lys Leu 
595 600 605 

Ala Gly Glu Ala Ser Thr Thr Ser Thr Ser Glu Asn Gly Ala Ala Ser 
610 615 620 

Asn Gly Asn Gly Asn Gly Ala Ser Ala Ser Lys Thr Ser Ala Ala Lys 
625 630 635 640 



Pro Leu Val Ser Ala Ala Thr Arg Lys Ser Ala 
645 650 



<210> 6 
<211> 1314 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of the artificial sequence: fragment 
of the complete cDNA coding for the GBSSI 
of Chlamydomonas reinhardtii 

<220> 

<221> CDS 

<222> (1) . . (1314) 

<400> 6 

gcg ctg gac ate gtg atg gtt get get gag gtc gee cct tgg tec aag 48 

Ala Leu Asp lie Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 

15 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 96 
Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro lie Glu Leu Val 
20 25 30 

aag cgc ggc cac cgc gtc atg acc att gec cct cgc tac gac cag tac 144 
Lys Arg Gly His Arg Val Met Thr lie Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

get gac gec tgg gac acc teg gtg gtc gtg gac ate atg ggc gag aag 192 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp lie Met Gly Glu Lys 
50 55 60 

gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 240 
Val Arg Tyr Phe His Ser lie Lys Lys Gly Val His Arg Val Trp lie 
65 70 75 80 

gac cac ccc tgg ttc ctg gec aag gtc tgg ggc aag acc ggc tec aag 288 
Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 336 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 



ttc gec ctg ttc tgc aag gee get att gag get gec cgc gtg ctg ccc 384 
Phe Ala Leu Phe Cys Lys Ala Ala lie Glu Ala Ala Arg Val Leu Pro 
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120 125 

ttc ggc ccc ggc gag gac tgc gtc ttc gtg gcc aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

gcc ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag * 480 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 



160 



ttc acc aag gcc aag teg gtg ctg get ate cac aac ate gcc ttc cag 
Phe Thr Lys Ala Lys Ser Val Leu Ala He His Asn He Ala Phe Gin 
165 170 175 



528 



ggc cgc atg tgg .gag gag get ttc aag gac acg aag ctg ccc cag gcc 57 6 
Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

gcc ttt gac aag ctg gcc ttc teg gac ggc tat gcc aag gtt tac act 624 
Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

gag gcc acc ccc atg gag gag gac gag aag ccc ccg ctg acg gga aag 672 
Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

acc tac aag aag ate aac tgg ctg aag ggt ggc att ate gcc gcc gac 720 
Thr Tyr Lys Lys He Asn Trp Leu Lys Gly Gly He He Ala Ala Asp 
225 230 235 240 

aag ctg gtg act gtg teg ccc aac tac gcg acc gag ate get gcc gat 768 
Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu He Ala Ala Asp 
245 250 255 

gcc gcc ggc ggt gtg gag ctg gac acc gtc ate cgc gcc aag ggc att 816 
Ala Ala Gly Gly Val Glu Leu Asp Thr Val He Arg Ala Lys Gly He 
260 265 270 

gag ggc att gtg aac ggc atg gac att gag gag tgg aac ccc aag acc 8 64 
Glu Gly He Val Asn Gly Met Asp He Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

gac aag ttc ctg tct gcg ccc tac gac cag aac age gtc tac gcc ggc 912 
Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

aag gcc gcc gcc aag gag gcc ctg cag gcc gag ctg ggc ctg cct gtg 960 
Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

gac ccc acc gcc ccc ctg ttc gcc ttc ate ggc cgc ctg gag gag cag 1008 
Asp Pro Thr Ala Pro Leu Phe Ala Phe He Gly Arg Leu Glu Glu Gin 
325 330 335 

aag ggt gtg gac ate ate ctg gcc gcc ctg ccc aag ate ctg gcc acc 1056 
Lys Gly Val Asp He He Leu Ala Ala Leu Pro Lys He Leu Ala Thr 
340 345 350 

ccc aag gtg cag ate gcc ate ctg ggt acc ggc aag gcc gcc tac gag 1104 
Pro Lys Val Gin He Ala He Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 
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aag ctg gtg aac gcc ate ggc acc aag tac aag ggc cgc gec aag ggc 
Lys Leu Val Asn Ala He Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 



1152 



gtg gtc aag ttc teg gcg ccc ctg gcg cac atg etc acc gcc ggc gcc 
Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 



1200 



gac ttc atg ctg gtg ccc teg cgc ttc gag ccc tgc ggc ctg ate cag 
Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu He Gin 
405 410 415 



1248 



ctg cac gcc atg cac tac ggt acc gtg ccc gtg gta gcc tec acc ggc 
Leu His Ala Met Mis Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 



1296 



ggc ctg gtc gac acc gtc 
Gly Leu Val Asp Thr Val 
435 



1314 



<210> 7 
<211> 438 
<212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence: fragment 
of the complete cDNA coding for the GBSSI 
of Chlamydomonas reinhardtii 

<400> 7 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro He Glu Leu Val 
20 25 30 

Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp He 
6 5 70 75 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 * 110 

Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 

Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 
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Phe Thr Lys Ala Lys Ser Val Leu Ala He His Asn He Ala Phe Gin 
165 170 175 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

Thr Tyr Lys Lys .-He Asn Trp Leu Lys Gly Gly He He Ala Ala Asp 
225 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu He Ala Ala Asp 
245 250 255 

Ala Ala Gly Gly Val Glu Leu Asp Thr Val He Arg Ala Lys Gly He 
260 265 270 

Glu Gly He Val Asn Gly Met Asp He Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

Asp Pro Thr Ala Pro Leu Phe Ala Phe He Gly Arg Leu Glu Glu Gin 
325 330 335 

Lys Gly Val Asp He He Leu Ala Ala Leu Pro Lys He Leu Ala Thr 
340 345 350 

Pro Lys Val Gin He Ala He Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

Lys Leu Val Asn Ala He Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu He Gin 
405 410 415 

Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 ' 430 

Gly Leu Val Asp Thr Val 
435 



<210> 8 
<211> 1593 
<212> DNA 
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<213> Artificial sequence 
<220> 

<223> Description of the artificial sequencer fragment 
of the complete cDNA coding for the GBSSI 
of Chlamydomonas reinhardtii 

<220> 

<221> CDS 

<222> (1) . . (1593) 

<400> 8 

gcg ctg gac ate gtg atg gtt get get gag gtc gec cct tgg tec aag 48 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
1 ■; 5 10 15 

acg ggc ggc ctg ggc gat gtg act ggt ggc ctg cct att gag ctg gtc 96 
Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro He Glu Leu Val 
20 25 30 

aag cgc ggc cac cgc gtc atg acc att gee cct cgc tac gac cag tac 144 
Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

get gac gec tgg gac acc teg gtg gtc gtg gac ate atg ggc gag aag 192 
Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 60 

gtc cgc tac ttc cac tec ate aag aag ggc gtg cac cgc gtg tgg att 240 
Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp He 
65 70 75 80 

gac cac ccc tgg ttc ctg gec aag gtc tgg ggc aag acc ggc tec aag 288 
Asp His Pro Trp Phe Leu "Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

ctg tac ggc ccc cgc tec ggc get gac tac ctg gac aac cac aag cgc 336 
Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

ttc gee ctg ttc tgc aag gec get att gag get gec cgc gtg ctg ccc 384 
Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 125 

ttc ggc ccc ggc gag gac tgc gtc ttc gtg gec aac gac tgg cac tec 432 
Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 
130 135 140 

gec ctg gtg ccc gtc ctg ctg aag gac gag tac cag ccc aag ggc cag 480 
Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 ^ 160 

ttc acc aag gec aag teg gtg ctg get ate cac aac ate gec ttc cag 528 
Phe Thr Lys Ala Lys Ser Val Leu Ala He His Asn He Ala Phe Gin 
165 170 175 

ggc cgc atg tgg gag gag get ttc aag gac acg aag ctg ccc cag gec 57 6 
Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 . 190 



gec ttt gac aag ctg gee ttc teg gac ggc tat gec aag gtt tac act 624 
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Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

gag gcc ace ccc atg gag gag gac gag aag ccc ccg ctg acg gga aag 672 
Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

acc tac aag aag ate aac tgg ctg aag ggt ggc att ate gcc gcc gac 720 
Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 230 235 240 

aag ctg gtg act gtg teg ccc aac tac gcg acc gag ate get gcc gat 768 
Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

gcc gcc ggc ggt gtg gag ctg gac acc gtc ate cgc gcc aag ggc att 816 
Ala Ala Gly Gly Val Glu Leu Asp Thr Val lie Arg Ala Lys Gly lie 
260 t 265 270 

gag ggc att gtg aac ggc atg gac att gag gag tgg aac ccc aag acc 8 64 
Glu Gly lie Val Asn Gly Met Asp lie Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

gac aag ttc ctg tct gcg ccc tac gac cag aac age gtc tac gcc ggc 912 
Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

aag gcc gcc gcc aag gag gcc ctg cag gcc gag ctg ggc ctg cct gtg 960 
Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

gac ccc acc gcc ccc ctg ttc gcc ttc ate ggc cgc ctg gag gag cag 1008 
Asp Pro Thr Ala Pro Leu Phe Ala Phe lie Gly Arg Leu Glu Glu Gin 
325 330 335 

aag ggt gtg gac ate ate ctg gcc gcc ctg ccc aag ate ctg gcc acc 105 6 
Lys Gly Val Asp He He Leu Ala Ala Leu Pro Lys He Leu Ala Thr 
340 345 350 

ccc aag gtg cag ate gcc ate ctg ggt acc ggc aag gcc gcc tac gag 1104 
Pro Lys Val Gin He Ala He Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

aag ctg gtg aac gcc ate ggc acc aag tac aag ggc cgc gcc aag ggc 1152 
Lys Leu Val Asn Ala He Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

gtg gtc aag ttc teg gcg ccc ctg gcg cac atg etc acc gcc ggc gcc 1200 
Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

gac ttc atg ctg gtg ccc teg cgc ttc gag ccc tgc ggc ctg ate cag 1248 
Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu He Gin 
405 410 415 

ctg cac gcc atg cac tac ggt acc gtg ccc gtg gta gcc tec acc ggc 1296 
Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

ggc ctg gtc gac acc gtc aag gag ggc gtc acc ggc ttc cac atg ggc 1344 
Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 



19 



435 440 445 

gcc ctg aac ccc gac aag ctg gac gag get gac gec gac gec ctg gec 1392 
Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 

gcc acc gtg cgc cgt gcc age gag gtg ttt gcg ggc ggc cgc tac ccc 1440 
Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 470 475 480 

gag atg gtg gcc aac tgc ate age cag gac ctg tec tgg tec aag ccc 1488 
Glu Met Val Ala Asn Cys lie Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

gcc cag aag tgg gag ggc ctg ctg gag gag gtg gtg tac ggc aag ggc 1536 
Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 510 

ggc gtg gcc acc gcc aag aag gag gag ate aag gtg ccc gtt gcc gag 1584 
Gly Val Ala Thr Ala Lys Lys Glu Glu lie Lys Val Pro Val Ala Glu 
515 520 525 

aag ate ccc 1593 
Lys lie Pro 
530 



<210> 9 
<211> 531 
<212> PRT 

<213> Artificial sequence 

<223> Description of the artificial sequence: fragment 
of the complete cDNA coding for the GBSSI 
of Chlamydomonas reinhardtii 

<400> 9 

Ala Leu Asp He Val Met Val Ala Ala Glu Val Ala Pro Trp Ser Lys 
15 10 15 

Thr Gly Gly Leu Gly Asp Val Thr Gly Gly Leu Pro He Glu Leu Val 
20 25 30 

Lys Arg Gly His Arg Val Met Thr He Ala Pro Arg Tyr Asp Gin Tyr 
35 40 45 

Ala Asp Ala Trp Asp Thr Ser Val Val Val Asp He Met Gly Glu Lys 
50 55 60 

Val Arg Tyr Phe His Ser He Lys Lys Gly Val His Arg Val Trp He 
65 70 75 80 

Asp His Pro Trp Phe Leu Ala Lys Val Trp Gly Lys Thr Gly Ser Lys 
85 90 95 

Leu Tyr Gly Pro Arg Ser Gly Ala Asp Tyr Leu Asp Asn His Lys Arg 
100 105 110 

Phe Ala Leu Phe Cys Lys Ala Ala He Glu Ala Ala Arg Val Leu Pro 
115 120 * 125 



Phe Gly Pro Gly Glu Asp Cys Val Phe Val Ala Asn Asp Trp His Ser 



20 



130 135 140 

Ala Leu Val Pro Val Leu Leu Lys Asp Glu Tyr Gin Pro Lys Gly Gin 
145 150 155 160 

Phe Thr Lys Ala Lys Ser Val Leu Ala lie His Asn lie Ala Phe Gin 
165 170 175 

Gly Arg Met Trp Glu Glu Ala Phe Lys Asp Thr Lys Leu Pro Gin Ala 
180 185 190 

Ala Phe Asp Lys Leu Ala Phe Ser Asp Gly Tyr Ala Lys Val Tyr Thr 
195 200 205 

Glu Ala Thr Pro Met Glu Glu Asp Glu Lys Pro Pro Leu Thr Gly Lys 
210 215 220 

Thr Tyr Lys Lys lie Asn Trp Leu Lys Gly Gly lie lie Ala Ala Asp 
225 ' 230 235 240 

Lys Leu Val Thr Val Ser Pro Asn Tyr Ala Thr Glu lie Ala Ala Asp 
245 250 255 

Ala Ala Gly Gly Val Glu Leu Asp Thr Val He Arg Ala Lys Gly He 
260 265 270 

Glu Gly He Val Asn Gly Met Asp He Glu Glu Trp Asn Pro Lys Thr 
275 280 285 

Asp Lys Phe Leu Ser Ala Pro Tyr Asp Gin Asn Ser Val Tyr Ala Gly 
290 295 300 

Lys Ala Ala Ala Lys Glu Ala Leu Gin Ala Glu Leu Gly Leu Pro Val 
305 310 315 320 

Asp Pro Thr Ala Pro Leu Phe Ala Phe He Gly Arg Leu Glu Glu Gin 
325 330 335 

Lys Gly Val Asp He lie Leu Ala Ala Leu Pro Lys He Leu Ala Thr 
340 345 350 

Pro Lys Val Gin He Ala He Leu Gly Thr Gly Lys Ala Ala Tyr Glu 
355 360 365 

Lys Leu Val Asn Ala He Gly Thr Lys Tyr Lys Gly Arg Ala Lys Gly 
370 375 380 

Val Val Lys Phe Ser Ala Pro Leu Ala His Met Leu Thr Ala Gly Ala 
385 390 395 400 

Asp Phe Met Leu Val Pro Ser Arg Phe Glu Pro Cys Gly Leu He Gin 
405 410 ' 415 

Leu His Ala Met His Tyr Gly Thr Val Pro Val Val Ala Ser Thr Gly 
420 425 430 

Gly Leu Val Asp Thr Val Lys Glu Gly Val Thr Gly Phe His Met Gly 
435 440 445 



Ala Leu Asn Pro Asp Lys Leu Asp Glu Ala Asp Ala Asp Ala Leu Ala 
450 455 460 
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Ala Thr Val Arg Arg Ala Ser Glu Val Phe Ala Gly Gly Arg Tyr Pro 
465 470 475 480 

Glu Met Val Ala Asn Cys lie Ser Gin Asp Leu Ser Trp Ser Lys Pro 
485 490 495 

Ala Gin Lys Trp Glu Gly Leu Leu Glu Glu Val Val Tyr Gly Lys Gly 
500 505 510 



Gly Val Ala Thr Ala Lys Lys Glu Glu lie Lys Val Pro Val Ala Glu 
515 520 525 



Lys lie Pro 
530 



